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NOVEL HUMAN TRANSPORTER PROTEINS AND 
POLYNUCLEOTIDES ENCODING THE SAME 

5 The present application claims the benefit of U.S. 

Provisional Application Nvunber 60/210,045 which was filed on 
June 7, 2000 and is herein incorporated by reference in its 
entirety. 

1 . INTRODUCTION 

10 The present invention relates to the discovery, 

identification, and characterization of novel human 
polynucleotides encoding proteins that share sequence 
siitdlarity with mammalian transporter proteins- The 
invention encompasses the described polynizcleo tides, host 

15 cell expression systems, the encoded proteins, fusion 
■ proteins, polypeptides and peptides, antibodies to the 
encoded proteins and peptides, and genetically engineered 
animals that . either lack or over express the disclosed 
polynucleotide sequences, antagonists and agonists of the 

20 proteins, and other compounds that modulate the expression or 
activity of the proteins encoded by the disclosed 
polynucleotide sequences that can be used for diagnosis, drug 
screening, clinical trial monitoring, the treatment of 
diseases and disorders, and cosmetic or nutriceutical 

25 applications. 

2. BACKGROUND OF THE INVENTION 
Transporter proteins are integral membrane proteins 
that mediate or facilitate the passage of materials across 
the lipid bilayer. Given that the transport of materials 
30 across the membrane can play an important physiological role, 
transporter proteins are good drug targets. Additiona^lly, 
one of the mechanisms of drug resistance involves diseased 
cells using cellular transporter systems to export 
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chemotherapeutic agents from the cell. Such mechanisms are 
particularly relevant to cells manifesting resistance to a 
multiplicity of drugs. 

3. SUMMARY OF THE INVENTION 
5 The present invention relates to the discovery/ 

identification, and characterization of nucleotides that 
encode novel human proteins, and the corresponding amino acid 
sequences of these proteins. The novel human proteins (NHPs) 
described for the first time herein share structural 

10 similarity with mamonnalian ion transporters, sulfate 

transporters, and particularly the sulfate transporter that 
has been associated with diastrophic dysplasia. 

The novel human nucleic acid sequences described 
herein, encode alternative proteins /open reading frames 

15 (ORFs) of 679, 621, 663, 605, 656, and 598 amino acids in 

length (see respectively SEQ ID NOS: 2, 4, 6, 8, 10, and 12. 
The invention also encompasses agonists and antagonists of 
the described NHPs, including small molecules, large 
molecules, mutant NHPs, or portions thereof, that compete 

20 with native NHP, peptides, and antibodies, as well as 
nucleotide sequences that can be used to inhibit the 
expression of the described NHPs (e.g., antisense and 
ribozyme molecules, and gene or regulatory sequence 
replacement constructs) or to eiihance the expression of the 

25 described NHP polynucleotides (e.g., expression constructs 
that place the described polynucleotide under the control of 
a strong promoter system) , and transgenic animals that 
express a NHP transgene, or "knock-outs" (which can be 
conditional) that do not express a functional NHP. Knock-out 

30 mice can be produced in several ways, one of which involves 
the use of mouse embryonic stem cells ("ES cells") lines that 
contain gene trap mutations in a imirine homolog of at least 
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one of the described NHPs. When the unique NHP sequences 

described in SEQ ID N0S:1-13 are *knocked-out* they provide a 

method of identifying phenotypic esqpression of the particular 

gene as well as a method of assigning function to previously 

5 iinknown genes. Additionally, the \xnique NHP sequences 

described in SEQ ID N0S:1-13 are useful for the 

identification of coding sequence and the mapping a \mique 

gene to a particular chromosome. 

Further, the present invention also relates to 

10 processes for identifying compoimds that modulate, i.e., act 

as agonists or antagonists, of NHP expression and/or NHP 

» 

activity that utilize purified preparations of the described 
NHPs and/or NHP product, or cells expressing the same. Such 
compounds can be used as therapeutic agents for the treatment 
15 of any of a wide variety of symptoms associated with 
biological disorders or imbalances. 



4. DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 
The Sequence Listing provides the sequences of NHP ORFs 
20 encoding the described NHP amino acid sequences. SEQ ID 
NO: 13 shows a NHP ORF and flanking regions. 

5. DETAILED DESCRIPTION OF THE INVENTION 
The NHPs described for the first time herein are novel 
25 proteins that may be expressed in, inter alia, human cell 

lines, pituitary, lymph node, kidney, testis, thyroid, heart, 
placenta, adipose, placenta, trachea, \ambilical vein 
endothelium, fetal brain, and fetal kidney cells. 

The present invention encompasses the nucleotides 
30 presented in the Sequence Listing, host cells expressing such 
nucleotides, the expression products of such nucleotides, 
and: (a) nucleotides that encode mammalian homologs of the 
described polynucleotide sequences, including the 
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specifically described NHPs, and the NHP products; 

(b) nucleotides that encode one or more portions of the NHPs 
that correspond to functional domains, and the polypeptide 
products specified by such nucleotide sequences, including 

5 but not limited to the novel regions of any active domain (s); 

(c) isolated nucleotides that encode mutant versions, 
engineered or naturally occurring, of the described NHPs in 
which all or a part of at least one domain is deleted or 
altered, and the polypeptide products specified by such 

10 nucleotide sequences, including but not limited to soluble 

proteins and peptides in which all or a portion of the signal 
(or hydrophobic transmembrane) sequence is deleted; (d) 
nucleotides that encode chimeric fusion proteins containing 
all or a portion of a coding region of an NHP, or one of its 

15 domains (e.g., a receptor or ligand binding domain, accessory 
protein/ self -association domain, etc.) fused to another 
peptide or polypeptide; or (e) therapeutic or diagnostic 
derivatives of the described polynucleotides such as 
oligonucleotides, antisense polynucleotides, ribozymes, 

20 dsRNA, or gene therapy constructs comprising a sequence first 
disclosed in the Sequence Listing. 

As discussed above, the present invention includes: 
(a) the human DNA sequences presented in the Sequence Listing 
(and vectors comprising the same) and additionally 

25 contemplates any nucleotide sequence encoding a contiguous 
NHP open reading frame (ORF) that hybridizes to a complement 
of a DNA sequence presented in the Sequence Listing under 
highly stringent conditions, e.g., hybridization to filter- 
bound DNA in 0.5 M NaHPO^, 7% sodium dodecyl sulfate (SDS) , 

30 1 mM EDTA at 65°C, and washing in O.lxSSC/0.1% SDS at 68°C 
(Ausubel P.M. et al., eds., 1989, Current Protocols in 
Molecular Biology, Vol. I, Green Publishing Associates, Inc., 
and John Wiley & sons. Inc., New York, at p. 2.10.3) and 
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encodes a functionally ecjuivalent gene product. Additionally 
contemplated are any nucleotide sequences that hybridize to 
the complement of a DNA sequence that encodes and expresses 
an amino acid sequence presented in the Sequence Listing 
5 under moderately stringent conditions, e.g., washing in 

0.2xSSC/0.1% SDS at 42<>C (Ausubel et al., 1989, supra), yet 
still encodes a functionally equivalent NHP product. 
Functional equivalents of a NHP include naturally occurring 
NHPs present in other species and mutant NHPs whether 

10 naturally occurring or engineered (by site directed 

imitagenesis, gene shuffling, directed evolution as described 
in, for example, U.S* Patent Nos. 5,837,458 and 5,723,323 
both of which are herein incorporated by reference in their 
entirety) . The invention also includes degenerate nucleic 

15 acid variants of the disclosed NHP polynucleotide sequences. 

Additionally contemplated are polynucleotides encoding 
NHP ORFs, or their functional equivalents, encoded by 
polynucleotide sequences that are about 99, 95, 90, or about 
85 percent similar or identical to corresponding regions of 

20 the nucleotide sequences of the Sequence Listing (as measured 
by BLAST sequence comparison analysis using, for example, the 
GCG sequence analysis package using standard default 
settings) . 

The invention also includes nucleic acid molecules, 
25 preferably DNA molecules, that hybridize to, and are 
therefore the complements of, the described NHP gene 
nucleotide sequences. Such hybridization conditions may be 
highly stringent or less highly stringent, as described 
above. In instances where the nucleic acid molecules are 
30 deoxyoligonucleotides ("DNA oligos"), such molecules are 
generally about 16 to about 100 bases long, or about 20 to 
about 80, or about 34 to about 45 bases long, or any 
variation or combination of sizes represented therein that 
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incorporate a contiguous region of sequence first disclosed 
in the Sequence Listing. Such oligonucleotides can be used 
in conjunction with the polymerase chain reaction (PGR) to 
screen libraries, isolate clones, and prepare cloning and 
5 sequencing templates, etc. 

Alternatively, such NHP oligonucleotides can be used as 
hybridization probes for screening libraries, and assessing 
gene expression patterns (particularly using a micro array or 
high- throughput ""chip" format) . Additionally, a series of 

10 the described NHP oligonucleotide sequences, or the 

complements thereof, can be used to represent all or a 
portion of the described NHP sequences. An oligonucleotide 
or polynucleotide sequence first disclosed in at least a 
portion of one or more of the sequences, of SEQ ID NOS: 1-13 

15 can be used as a hybridization probe in conjiinction with a 
solid support matrix/ substrate (resins, beads, membranes, 
plastics, polymers, metal or metallized siibstrates, 
crystalline or polycrystalline substrates, etc.). Of 
particular note are spatially addressable arrays (i.e., gene 

20 chips, microtiter plates, etc.) of oligonucleotides and 
polynucleotides, or corresponding oligopeptides and 
polypeptides, wherein at least one of the biopolymers present 
on the spatially addressable array comprises an 
oligonucleotide or polynucleotide sequence first disclosed in 

25 at least one of the sequences of SEQ ID NOS: 1-13, or an 

amino acid sequence encoded thereby. Methods for attaching 
biopolymers to, or synthesizing biopolymers on, solid support 
matrices, and conducting binding studies thereon are 
disclosed in, inter alia, U.S. Patent Nos. 5,700,637, 

30 5,556,752, 5,744,305, 4,631,211, 5,445,934, 5,252,743, 

4,713,326, 5,424,186, and 4,689,405 the disclosures of which 
are herein incorporated by reference in their entirety. 
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Addressable arrays comprising sequences first disclosed 
in SEQ ID N0S:1-13 can be used to identify and characterize 
the tenrporal and tissue specific expression of a gene. These 
addressable arrays incorporate oligonucleotide sequences of 
5 sufficient length to confer the required specificity,, yet be. 
within the limitations of the production technology. The 
length of these probes is within a range of between about 8 
to about 2000 nucleotides. Preferably the probes consist of 
60 nucleotides and more preferably 25 nucleotides from the 

10 sequences first disclosed in SEQ ID N0S:1-13- 

For example, a series of the described oligonucleotide 
sequences, or the complements thereof, can be used in chip 
format to represent all or a portion of the described 
sequences. The oligonucleotides, typically between about 16 

15 to about 40 (or any whole number within the stated range) 

nucleotides in length can partially overlap each other and/or 
the sequence may be represented using oligonucleotides that 
do not overlap. Accordingly, the described polynucleotide 
sequences shall typically comprise at least about two or 

20 three distinct oligonucleotide sequences of at least about 8 
nucleotides in length that are each first disclosed in the 
described Sequence Listing. Such oligonucleotide sequences 
can begin at any nucleotide present within a sequence in the 
Sequence Listing and proceed in either a sense (5'-to-3') 

25 orientation vis-a-vis the described sequence or in an 
antisense orientation. 

Microarray-based analysis allows the discovery of broad 
patterns of genetic activity, providing new understanding of 
gene functions and generating novel and xanexpected insight 

30 into transcriptional processes and biological mechanisms. The 
use of addressable arrays comprising sequences first 
disclosed in SEQ ID N0S:1-13 provides detailed information 
about transcriptional changes involved in a specific pathway, 

7 
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potentially leading to the identification of novel conponents 
or gene fxinctions that manifest themselves as novel 
phenotypes - 

Probes consisting of sequences first disclosed in SEQ 
5 ID N0S:1-13 can also be used in the identification, selection 
and validation of novel molecular targets for drug discovery. 
The use of these unique sequences permits the direct 
confirmation of drug targets and recognition of drug 
dependent changes in gene expression that are modulated 

10 through pathways distinct from the drugs intended target. 
These unique sequences therefore also have utility in 
defining and monitoring both drug action and toxicity. 

As an example of utility, the sequences first disclosed 
in SEQ ID N0S:1-13 can be utilized in microarrays or other 

15 assay formats, to screen collections of genetic material from 
patients who have a particular medical condition. These 
investigations can also be carried out using the sequences 
first disclosed in SEQ ID N0S:1-13 in silico and by coinparing 
previously collected genetic databases and the disclosed 

20 sequences using computer software known to those in the art. 

Thus the sequences first disclosed in SEQ ID N0S:i-13 
can be used to identify mutations associated with a 
particular disease and also as a diagnostic or prognostic 
assay. 

25 Although the presently described sequences have been 

specifically described using nucleotide sequence, it should 
be appreciated that each of the sequences can uniquely be 
described using any of a wide variety of additional 
structural attributes, or combinations thereof. For example, 

30 a given sequence can be described by the net composition of 
the nucleotides present within a given region of the sequence 
in conjunction with the presence of one or more specific 
oligonucleotide sequence (s) first disclosed in the SEQ ID 
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NOS: 1-13. Alternatively, a restriction map specifying the 
relative positions of restriction endonuclease digestion 
sites, or various palindromic or other specific 
oligonucleotide sequences can be used to structiirally 
5 describe a given sequence. Such restriction maps, which are 
typically generated by widely available computer programs 
{e.g., the University of Wisconsin GCG sequence analysis 
package, SEQUENCHER 3.0, Gene Codes Corp., Ann Arbor, MI, 
etc . ) , can optionally be used in conjunction with one or more 

10 discrete nucleotide sequence(s) present in the sequence that 
can be described by the relative position of the sequence 
relatve to one or more additional sequence (s) or one or more 
restriction sites present in the disclosed sequence. 

For oligonucleotide probes, highly stringent conditions 

15 may refer, e.g., to washing in 6xSSC/0.05% sodium 

pyrophosphate at 37^C (for 14-base oligos) , 48®C (for 17-base 
oligos) , 55^0 (for 20-base oligos), and 60^0 (for 23-base 
oligos) . These nucleic acid molecules may encode or act as 
NHP gene antisense molecules, useful, for example, in NHP 

20 gene regulation (for and/or as antisense primers in 

amplification reactions of NHP gene nucleic acid sequences) . 
With respect to NHP gene regulation, such techniques can be 
used to recrulate biological functions- Further, such 
sequences may be used as part of ribozyme and/or triple helix 

25 sequences that are also useful for NHP gene regulation . 
Inhibitory antisense or double stranded 
oligonucleotides can additionally comprise at least one 
modified base moiety which is selected from the group 
including but not limited to 5-f luorouracil, 5-bromouracil, 

30 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 

4- acetylcytosine , 5 - { carboxyhydroxylmethyl ) uracil , 

5- carboxymethylaminomethyl-2 - thi our i dine , 
5-carboxymethylaminomethyluracil , dihydrouracil , beta-D- 
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galactosylqueosinei inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methylinosine, 2 , 2 -dimethyl guanine, 

2- inethyladenine, 2-inethylguanine, 3 -methyl cytosine, 
5-methylcytosine, N6-adenine, 7-methylguanine, 

5 5-methylaminamethyluracil , 5-methoxyaminomethyl-2-thiouracil , 
beta-D-maimosylqueosine , 5 '-methoxycarboxyme thyluracil , 
5-methoxyuracil , 2-methylthio-N6-isopentenyladenine , 
uracil-5-oxyacetic acid (v) , wybutoxosine, pseudouracil, 
queos ine , 2 - thiocy t os ine , 5 -methyl - 2 - thiouraci 1 , 

10 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil- 

5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v) , 
5-methyl-2-thiouracil , 3- ( 3 -amino-3-N-2-carboxypropyl) 
uracil, (acp3)w, and 2, 6-diaminopurine. 

The antisense oligonucleotide can also comprise at 

15 least one modified sugar moiety selected from the group 
including but not limited to arabinose, 2-f luoroarabinose, 
xylulose, and hexose. 

In yet another embodiment, the antisense 
oligonucleotide will comprise at least one modified phosphate 

20 backbone . selected from the group consisting of a 
phosphorothioate, a phosphorodithioate, a 

phosphoramidothioate, a phosphoramidate, a phosphordiamidate, 
a methylphosphonate, an alkyl phosphotriester, and a 
formacetal or analog thereof. 

25 In yet another embodiment, the antisense 

oligonucleotide is an a-anomeric oligonucleotide. An a- 
anomeric oligonucleotide forms specific doxible-stranded 
hybrids with complementary RNA in which, contrary to the 
usual 3-units, the strands run parallel to each other 

30 (Gautier et al., 1987, Nucl. Acids Res. 15:6625-6641). The 
oligonucleotide is a 2'-0-methylribonucleotide (Inoue et al., 
1987, Nucl. Acids Res. 15:6131-6148), or a chimeric RNA-DNA 
analogue (Inoue efc al., 1987, FEBS Lett. 215:327-330). 
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Alternatively, doiible stranded RNA can be used to disrupt the 
expression and function of a targeted NHP. 

Oligonucleotides of the invention can be synthesized by 
standard methods known in the art, e.g, by use of an 
5 automated DNA synthesizer {such as are coitimercially available 
from Biosearch, Applied Biosystems, etc). As examples, 
phosphorothioate oligonucleotides can be synthesized by the 
method of Stein at ai. (1988, Nucl. Acids Res. 15:3209), and 
methylphosphonate oligonucleotides can be prepared by use of 

10 controlled pore glass polymer supports (Sarin et al., 1988, 
Proc. Natl. Acad. Sci. U.S.A. 85:7448-7451), etc. 

Low stringency conditions are well known to those of 
skill in the art, and will vary predictably depending on the 
specific organisms from which the library and the labeled 

15 sequences are derived. For guidance regarding such 

conditions see, for example, Sambrook et al., 1989, Molecular 
Cloning, A Laboratory Manual (and periodic updates thereof) , 
Cold Springs Harbor Press, N.Y.; and Ausubel et al., 1989, 
Current Protocols in Molecular Biology, Green Publishing 

20 Associates and Wiley Interscience, N.Y. 

Alternatively, suitably labeled NHP nucleotide probes 
can be used to screen a hiaman genomic library using 
appropriately stringent conditions or by PCR. The 
identification and characterization of h\iman genomic clones 

25 is helpful for identifying polymorphisms (including, but not 
limited to, nucleotide repeats, microsatellite alleles, 
single nucleotide polymorphisms, or coding single nucleotide 
polymorphisms), determining the genomic structure of a given 
locus/allele, and designing diagnostic tests. For example, 

30 sequences derived from regions adjacent to the intron/exon 
boundaries of the human gene can be used to design primers 
for use in amplification assays to detect mutations within 
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the exoxis, introns, splice sites (e.g., splice acceptor 
and/or donor sites) / etc., that can be used in diagnostics 
and pharmacogenomics . 

Further, a NHP gene homolog can be isolated from 
5 nucleic acid from an organism of interest by performing PGR 
using two degenerate or "Vobble" oligonucleotide primer pools 
designed on the basis of amino acid sequences within the NHP 
products disclosed herein. The template for the reaction may 
be total RNA, mRNA, and/or cDNA obtained by reverse 

10 transcription of mRNA prepared from human or non-human cell 
lines or tissue known or suspected to express an allele of a 
NHP gene. The PGR product can be siibcloned and sequenced to 
ensure that the amplified sequences represent the sequence of 
the desired NHP gene. The PGR fragment can then be used to 

15 isolate a full length cDNA clone by a variety of methods : 
For example, the amplified fragment can be labeled and used 
to screen a cDNA library, such as a bacteriophage cDNA 
library. Alternatively, the labeled fragment can be lised to 
isolate genomic clones via the screening of a genomic 

20 library. 

PGR technology can also be used to isolate full length 
CDNA sec[uences- For example, RNA can be isolated, following 
standard procedures, from an appropriate cellular or tissue 
source (i.e., one known, or suspected, to express a NHP 

25 gene) . A reverse transcription (RT) reaction can be 

performed on the RNA using an oligonucleotide primer specific 
for the most 5* end of the amplified fragment for the priming 
of first strand synthesis. The resulting RNA/DNA hybrid may 
then be "tailed" using a standard terminal transferase 

30 reaction, the hybrid may be digested with RNase H, and second 
strand synthesis may then be primed with a complementary 
primer. Thus, cDNA sequences upstream of the amplified 

12 
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fragment can be isolated. For a review of cloning strategies 
that can be used, see e.g., Sambrook et al., 1989, supra. 

A cDNA encoding a mutant NHP gene can be isolated, for 
exanple, by using PGR. In this case, the first cDNA strand 
5 may be synthesized by hybridizing an oligo-dT oligonucleotide 
to mRNA isolated from tissue known or suspected to be 
expressed in an individual putatively carrying a mutant NHP 
allele, and by extending the new strand with reverse 
transcriptase* The second strand of the cDNA is then 

10 synthesized using an oligonucleotide that hybridizes 

specifically to the 5' end of the normal gene. Using these 
two primers, the product is then amplified via PGR, 
optionally cloned into a suitable vector, and s\ibjected to 
DNA sequence analysis through methods well known to those of 

15 skill in the art. By comparing the DNA sequence of the 
mutant NHP allele to that of a corresponding normal NHP 
allele, the imitation (s) responsible for the loss or 
alteration of function of the mutant NHP gene product can be 
ascertained. 

20 Alternatively, a genomic library can be constructed 

using DNA obtained from an individual suspected of or known 
to carry a mutant NHP allele (e.g., a person manifesting a 
NHP-associated phenotype such as, for example, obesity, high 
blood pressure, connective tissue disorders, infertility, 

25 etc . ) , or a cDNA library can be constructed using RNA from a 
tissue known, or suspected, to esq^ress a mutant NHP allele. 
A normal NHP gene, or any suitable fragment thereof, can then 
be labeled and used as a probe to identify the corresponding 
mutant NHP allele in such libraries. Clones containing 

30 mutant NHP gene sequences can then be purified and subjected 
to sequence analysis according to methods well known to those 
skilled in the art. 
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Additionally, an expression library can be constructed 
utilizing cDNA synthesized from, for example, RNA isolated 
from a tissue known, or suspected, to express a mutant NHP 
allele in an individual suspected of or known to carry such a 
5 mutant allele. In this manner, gene products made by the 
putatively mutant tissue can be expressed and screened using 
standard antibody screening techniques in conjimction with 
antibodies raised against a normal NHP product, as described 
below. (For screening techniques, see, for example, Harlow, 

10 E. and Lane, eds., 1988, "Antibodies: A Laboratory Manual " , 
Cold Spring Harbor Press, Cold Spring Harbor, NY.) 

Additionally, screening can be accomplished by 
screening with labeled NHP fusion proteins, such as, for 
example, alkaline phosphatase-NHP or NHP-alkaline phosphatase 

15 fusion proteins. In cases where a NHP mutation results in an 
expressed gene product with altered fimction {e.g., as a 
result of a missense or a frameshif t mutation) , polyclonal 
antibodies to a NHP are likely to cross-react with a 
corresponding mutant NHP gene product. Library clones 

20 detected via their reaction with such labeled antibodies can 
be purified and subjected to sequence analysis according to 
methods well known in the art. 

The invention also encompasses (a) DNA vectors that 
contain any of the foregoing NHP coding sequences and/or 

25 their complements (i.e., antisense) ; (b) DNA expression 
vectors that contain any of the foregoing NHP coding 
sequences operatively associated with a regulatory element 
that directs the expression of the coding sequences (for 
example, baculo virus as described in U.S. Patent No. 

30 5,869,336 herein incorporated by reference); (c) genetically 
engineered host cells that contain any of the foregoing NHP 
coding sequences operatively associated with a regulatory 
element that directs the expression of the coding sequences 

14 
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in the host cell; and (d) genetically engineered host cells 
that express an endogenous NHP gene iinder the control of an 
exogenously introduced regulatory element (i.e., gene 
activation) . As used herein, regulatory elements include, 
5 but are not limited to, inducible and non- inducible 

promoters, enhancers, operators and other elements known to 
those skilled in the art that drive and regulate expression. 
Such regulatory elements include but are not limited to the 
cytomegalovirus (hCMV) immediate early gene, regulatable, 

10 viral elements (particularly retroviral LTR promoters) , the 
early or late promoters of SV40 adenovirus, the lac system, 
the trp system, the TAC system, the TRC system, the major 
operator and promoter regions of phage lambda, the control 
regions of fd coat protein > the promoter for 

15 3-phosphoglycerate kinase (PGK) , the promoters of acid 

phosphatase, and the promoters of the yeast a-mating factors. 

The present invention also encompasses antibodies and 
anti-idiotypic antibodies (including Fab fragments), 
antagonists and agonists of the NHP, as well as compounds or 

20 nucleotide constructs that inhibit expression of a NHP gene 
(transcription factor inhibitors, antisense and ribozyme 
molecules, of gene or regulatory sequence replacement 
constructs), or promote the expression of a NHP (e.g., 
esqpression constructs in which NHP coding sequences are 

25 operatively associated with expression control elements such 
as promoters, promoter /enhancers, etc.). 

The NHPs or NHP peptides, NHP fusion proteins, NHP 
nucleotide sequences, antibodies, antagonists and agonists 
can be useful for the detection of mutant NHPs or 

30 inappropriately expressed NHPs for the diagnosis of disease. 
The NHP proteins or peptides, NHP fusion proteins, NHP 
nucleotide sequences, host cell expression systems, 
antibodies, antagonists, agonists and genetically engineered 

15 
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cells and animals can be used for screening for drugs (or 
high throughput screening of combinatorial libraries) 
effective in the treatment of the symptomatic or phenotypic 
manifestations of perturbing the normal function of NHP in 
5 the body. The use of engineered host cells and/or animals 
may offer .an advantage in that such systems allow not only 
for the identification of compounds that bind to the 
endogenous receptor for an NHP, but can also identify 
compounds that trigger NHP-mediated activities or pathways. 

10 Finally, the NHP products can be used as therapeutics - 

For example, soliible derivatives such as NHP peptides /domains 
corresponding to NHPs, NHP fusion protein products 
(especially NHP-Ig fusion proteins, i.e., fusions of a NHP, 
or a domain of a NHP, to an IgFc) , NHP antibodies and anti- 

15 idiotypic antibodies (including Fab fragments) , antagonists 
or agonists (including conpounds that modulate or act on 
downstream targets in a NHP-mediated pathway) can be used to 
directly treat diseases or disorders. For instance, the 
administration of an effective amount of soluble NHP, or a 

20 NHP-IgFc fusion protein or an anti-idiotypic antibody (or its 
Fab) that mimics the NHP could activate or effectively 
antagonize the endogenous NHP receptor. Nucleotide 
constructs encoding such NHP products can be used to 
genetically engineer host cells to escpress such products in 

25 vivo; these genetically engineered cells fimction as 

"bioreactors" in the body delivering a continuous supply of a 
NHP, a NHP peptide, or a NHP fusion protein to the body. 
Nucleotide constructs encoding functional NHPs, mutant NHPs, 
as well as antisense and ribozyme molecules can also be used 

30 in "gene therapy" approaches for the modulation of NHP 
expression. Thus, the invention also encompasses 
pharmaceutical formulations and methods for treating 
biological disorders. 
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Various aspects of the invention are described in 
• greater detail in the subsections below. 

5.1 THE ISTHP SEQUENCES 
5 The cDNA sequences and the corresponding deduced amino 

acid sequences of the described NHPs are presented in the 
Sec[uence Listing. The NHP nucleotides were obtained from 
clustered human ESTs, htaman thyroid RACE products, and cDNAs 
from kidney and thyroid libraries (Edge Biosystems, 

10 Gaithersburg, MD) . The described NHPs are similar to 
eucaryotic sulfate transporters and hxaman pendrin. A 
particularly similar sulfate transporter has been associated 
with diastrophic dysplasia which has been associated with 
dwarfism, inherited chondrodysplasia, and ,osseous dysplasia 

15 I. Accordingly, the described NHPs can be useful in 
detecting and treating such conditions. 

A polymorphism was detected consisting of an A-T 
transversion at the sequence region represented by, for 
example, nucleotide nvuriber 1454 of SEQ ID N0:1 which can 

20 result in a corresponding E or V being present at amino acid 
position 485 of, for example, SEQ ID N0:2. 

Trcinsporters and transporter related multidrug 
resistance (MDR) seciuences, as well as uses and applications 
that are geimane to the described NHPs, are described in U.S. 

25 Patents Nos. 5,198,344 and 5,866,699 which are herein 
incorporated by reference in their entirety. 

An additional application of the described novel human 
polynucleotide sequences is their use in the molecular 
mutagenesis/evolution of proteins that are at least partially 

30 encoded by the described novel sequences using, for example, 
polynucleotide shuffling or related methodologies. Such 
approaches are described in U.S. Patents Nos. 5,830,721 and 



17 



wo 0im583 



PCT/USOl/18393 



5,837,458 which are herein incorporated by reference in their 
entirety- 

NHP gene products can also be expressed in transgenic 
animals. Animals of any species, including, but not limited 
5 to, worms, mice, rats, rabbits, guinea pigs, pigs, micro- 
pigs, birds, goats, and non-human primates, e.g., baboons, 
monkeys, and chimpanzees may be used to* generate NHP 
transgenic animals. 

Any technique known in the art may be used .to introduce 

10 a NHP transgene into animals to produce the foxinder lines of 
transgenic animals. Such techniques include, but are not 
limited to pronuclear microinjection (Hoppe, P.C. and Wagner, 
T.E., 1989, U.S. Pat. No. 4,873,191); retrovirus mediated 
gene transfer into germ lines (Van der Putten efc al., 1985, 

15 Proc. Natl. Acad. Sci., USA »2: 6148--6152) ; gene targeting in 
embryonic stem cells (Thompson et al., 1989, Cell 55:313- 
321); electroporation of * embryos (Lo, 1983, Mol Cell. Biol. 
3:1803-1814); and sperm-mediated gene transfer (Lavitrano et 
al., 1989, Cell 57:717-723); etc. For a review of such 

20 techniques, see Gordon, 1989, Transgenic Animals, Intl. Rev. 
Cytol. 115:171-229, which is incorporated by reference herein 
in its entirety. 

The present invention provides for transgenic animals 
that carry the NHP transgene in all their cells, as well as 

25 animals which carry the transgene in some, but not all their 
cells, i.e., mosaic animals or somatic cell transgenic 
animals- The transgene may be integrated as a single 
transgene or in conca tamers, e.g., head- to-head tandems or 
head-to-tail tandems. The transgene may also be selectively 

30 introduced into and activated in a particular cell type by 
following, for example, the teaching of Lasko et al., 1992, 
Proc. Natl. Acad. Sci, USA 85:6232-6236.* The regulatory 
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sequences required for such a cell-type specific activation 
will depend upon the particular cell type of interest, and 
will be apparent to those of skill in the art. 

When it is desired that a NHP transgene be integrated 
5 into the chromosomal site of the endogenous NHP gene, gene 
targeting is preferred. Briefly, when such a technique is to 
be utilized, vectors containing some nucleotide sequences 
homologous to the endogenous NHP gene are designed for the 
purpose of integrating, via homologous recombination with 

10 chromosomal sequences, into and disrupting the function of 
the nucleotide sequence of the endogenous NHP gene (i.e., 
^knockouf animals) . 

The transgene can also be selectively introduced into a 
particular cell type, thus inactivating the endogenous NHP 

15 gene in only that cell type, by following, for exanple, the 
teaching of Gu et al., 1994, Science, 255:103-106. The 
regulatory sequences required for such a cell-type specific 
inactivation will depend upon the particular cell type of 
interest, and will be apparent to those of skill in the art. 

20 Once transgenic animals have been generated, the 

expression of the recoinbinant NHP gene may be assayed 
utilizing standard techniques. Initial screening may be 
accomplished by Southern blot analysis or PGR techniques to 
analyze animal tissues to assay whether integration of the 

25 transgene has taken place. The level of mHNA expression of 
the transgene in the tissues of the transgenic animals may 
also be assessed using techniques which include but are not 
limited to Northern blot analysis of tissue samples obtained 
from the animal, in situ hybridization analysis, and RT-PCR. 

30 Samples of NHP gene-expressing tissue, may also be evaluated 
immunocytochemically using antibodies specific for the NHP 
transgene product. 
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5.2 NHPS AND NHP POLYPEPTIDES 
NHPs, polypeptides, peptide fragments, imitated, 
trrmcated, or deleted forms of the NHPs, and/or NHP fusion 
proteins can be prepared for a variety of uses. These uses 
5 include but are not limited to the generation of antibodies, 
as reagents in diagnostic assays, the identification of other 
cellular gene products related to a NHP, as reagents in 
assays for screening for compounds that can be used as 
pharmaceutical reagents useful in the therapeutic treatment 

id of mental, biological, or medical disorders and diseases. 
Given the similarity information and expression data, the 
described NHPs can be targeted (by drugs, oligos, antibodies, 
etc,) in order to treat disease, or to therapeutically 
augment the efficacy of, for example, chemo therapeutic agents 

15 used in the treatment of breast or prostate cancer. 

Hie Sequence Listing discloses the amino acid sequences 
encoded by the described NHP genes. The NHPs typically 
display have initiator methionines in DNA sequence contexts 
consistent with a translation initiation site. 

20 Hie NHP amino acid sequences of the invention include 

the amino acid sequence presented in the Sequence Listing as 
well as analogues and derivatives thereof. Further, 
corresponding NHP homologues from other species are 
encompassed by the invention. In fact, any NHP protein 

25 encoded by the NHP nucleotide sequences described above are 
within the scope of the invention, as are any novel 
polynucleotide sequences encoding all or any novel portion of 
.an amino acid sequence presented in the Sequ^ce Listing. 
The degenerate nature of the genetic code is well known, and, 

30 accordingly, each amino acid presented in the Sequence 
Listing, is generically representative of the well known 
nucleic acid ^triplet* codon, or in many cases codons, that 
can encode the amino acid. As such, as contemplated herein, 

20 
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the amino acid sequences presented in the ^ Sequence Listing, 
when taken together with the genetic code (see, for example. 
Table 4-1 at page 109 of ^Molecular Cell Biology", 1986, J. 
Darnell efc al. eds.. Scientific American Books, New York, NY, 
5 herein incorporated by reference) are generically 
representative of all the various permutations and 
combinations of nucleic acid sequences that can encode such 
amino acid sequences. 

The invention also encompasses proteins that are 

10 functionally equivalent to the NHPs encoded by the presently 
described nucleotide sequences as judged by any of a nxunber 
of criteria, including, but not limited to, the ability to 
bind and cleave a substrate of a NHP, or the ability to 
effect an identical or complementary downstream pathway, or a 

15 change in cellular metabolism [e.g., proteolytic activity, 

ion flux, tyrosine phosphorylation, etc.). Such functionally 
equivalent NHP proteins include, but are not limited to, 
additions or substitutions of amino acid residues within the 
amino acid sequence encoded by the NHP nucleotide sequences 

20 described above, but which result in a silent change, thus 

producing a functionally equivalent gene product. Amino acid 
substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, 
and/or the amphipathic nature of the residues involved. For 

25 example, nonpolar (hydrophobic) amino acids include alanine, 
leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan, and methionine; polar neutral amino acids include 
glycine, serine, threonine, cysteine, tyrosine, asparagine, 
and glutamine; positively charged (basic) amino acids include 

30 arginine, lysine, and histidine; and negatively charged 

(acidic) amino acids include aspartic acid and glutamic acid. 

A variety of host-expression vector systems can be used 
to express the NHP nucleotide sequences of the invention. 
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Where, as in the present instance, the NHP peptide or 
polypeptide is thought to be membrane protein, the 
hydrophobic regions of the protein can be excised and the 
resulting soliible peptide or polypeptide can be recovered 
5 from the culture media. Such expression systems also 
encompass engineered host cells that express a NHP, or 
functional equivalent, in situ. .Purification or enrichment 
of a NHP from such expression systems can be accomplished 
using appropriate detergents and lipid micelles and methods 

10 well known to those skilled in the art. However, such 

engineered host cells themselves may be used in situations 
where it is important not only to retain the structural and 
fxuictional characteristics of the NHP, but to assess 
biological activity, e.g., in drug screening assays. 

15 The expression systems that may be used for purposes of 

the invention include but are not limited to microorganisms 
such as bacteria {e.g., E. coli, B. subtilis) transformed 
with recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 
expression vectors containing NHP nucleotide sequences; yeast 

20 (e.g., Saccbaromyces, Picbia) transfoimed with recombinant 

yeast expression vectors containing NHP nucleotide sequences; 
insect cell systems infected with recombinant virus expres- 
sion vectors (e.g., baculovirus) containing NHP sequences; 
plant cell systems infected with recombinant virus expression 

25 vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic 
virus, TMV) or transformed with recombinant plasmid expres- 
sion vectors (e.g., Ti plasmid) containing NHP nucleotide 
sequences; or mammalian cell, systems (e.g. , COS, CHO, BHK, 
293, 3T3) harboring recombinant expression constructs 

30 containing promoters derived from the genome of mammalian 
cells {e.g., n^etallothionein promoter) or from mammalian 
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viruses {e.g., the adenovirus late promoter; the vaccinia 
virus 7.5K promoter). 

In bacterial systems, a number of expression vectors 
may be advantageously selected depending upon the use 
5 intended for the NHP product being expressed. For example, 
when a large quantity of such a protein is to be produced for 
the generation of pharmaceutical compositions of or 
containing NHP, or for raising antibodies to a NHP, vectors 
that direct the expression of high levels of fusion protein 

10 products that are readily purified may be desirable. Such 
vectors include, but are not limited, to the E. coli 
expression vector pUR278 (Ruther et al., 1983, EMBO J. 
2:1791), in which a NHP coding sequence may be ligated 
individually into the vector in frame with the lacZ coding 

15 region so that a fusion protein is produced; pIN vectors 

(Inouye & Inouye, 1985, Nucleic Acids Res. 13:3101-3109; Van 
Heeke & Schuster, 1989, J. Biol. Chem. 264:5503-5509); and 
the like. pGEX vectors (Pharmacia or American Type Culture 
Collection) can also be used to express foreign polypeptides"^ 

20 as fusion proteins with glutathione S-transf erase (GST) . In 
general, such fusion proteins are soluble and can easily be 
purified from lysed cells by adsorption to glutathione- 
agarose beads followed by elution in the presence of free 
glutathione. The PGEX vectors are designed to include 

25 thrombin or factor Xa protease cleavage sites so that the 
cloned target gene product can be released from the GST 
moiety. 

In an insect system, Autographa califorziica nuclear 
polyhidrosis virus (AcNPV) is used as a vector to express 
30 foreign genes. The virus grows in Spodoptera frugiperda 

cells. A NHP gene coding sequence may be cloned individually 
into non-essential regions (for example the polyhedrin gene) 
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of the virus and placed under control of an AcNPV promoter 
(for example the polyhedrin promoter) . Successful insertion 
of NHP gene coding sequence will result in inactivation of 
the polyhedrin gene and production of non-occluded 
5 recombinant* virus (i.e., virus lacking the proteinaceous coat 
coded for by the polyhedrin gene) . These recombinant viruses 
are then used to infect Spodoptera frugiperda cells in which 
the inserted gene is expressed {e.g., see Smith et al., 
1983, J. Virol. 45:584; Smith, U.S. Patent No. 4,215,051). 

10 In mairanalian host cells, a number of viral-based 

expression systCTis may be utilized. In cases where an 
adenovirus is used as an esqpression vector, the NHP 
nucleotide sequence of interest may be ligated to an 
adenovirus transcription/ translation control complex, e.gr. , 

15 the late promoter and tripartite leader sequence. This 

chimeric gene may then be inserted in the adenovirus genome 
by in vitro or in vivo recombination. Insertion in a non- 
essential region of the viral genome (e.g., region El or E3) 
will result in a recombinant virus that is viable and capable 

20 of expressing a NHP product in infected hosts (e.g.. See 
Logan & Shenk, 1984, Proc. Natl. Acad. Sci. USA fll:3655- 
3659) . Specific initiation signals may also be required for 
efficient translation of inserted NHP nucleotide sequences. 
These signals include the ATG initiation codon and adjacent 

25 sequences. In cases where an entire NHP gene or cDNA, 

including its own initiation codon and adjacent sequences, is 
inserted into the appropriate expression vector, no 
additional translational control signals may be needed. 
However, in cases where only a portion of a NHP coding 

30 sequence is inserted, exogenous translational control 

signals, including, perhaps, the ATG initiation codon, must 
be provided. Furthermore, the initiation codon itoist be in 

24 
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phase with the reading frame of the desired coding sequence 
to ensure translation of the entire insert. These exogenous 
translational control signals and initiation codons can be of 
a variety of origins, both natural and synthetic. The 
5 efficiency of expression may be enhanced by the inclusion of 
appropriate transcription enhancer elements, transcription 
terminators, etc. (See Bitter efc al., 1987, Methods in 
Enzymol- 153:516-544). 

In addition, a host cell strain may be chosen that 

10 modulates the expression of the inserted sequences, or 
modifies and processes the gene product in the specific 
fashion desired. Such modifications (e.g., glycosylation) 
and processing (e.g., cleavage) of protein products may be 
important for the fimction of the protein. Different host 

15 cells have characteristic and specific mechanisms for the 
post-translational processing and modification of proteins 
and gene products. Appropriate cell lines or host systems 
caii be chosen to ensure the correct modification and 
processing of the foreign protein expressed. To this end, 

20 eukaryotic host cells which possess the cellular machineiry 
for proper processing of the primary transcript, 
glycosylation, and phosphorylation of the gene product may be 
used. Such mammalian host cells include, but are not limited 
to, CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3, WI38, and in 

25 particular, himtan cell lines . 

For long-term, high-yield production of recombinant 
proteins, stable expression is preferred. For exanple, cell 
lines which stably express the NHP sequences described above 
can be engineered. Rather than using expression vectors 

30 which contain viral origins of replication, host cells can be 
transformed with DNA controlled by appropriate expression 
control elements (e.g., promoter, enhancer sequences, 
transcription terminators, polyadenylation sites, etc.), and 
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a selectable marker. Following the introduction of the 
foreign DNA, engineered cells may be allowed to grow for 1-2 
days in an enriched media, and then are switched to a 
selective media. The selectable marker in the recombinant 
5 plasmid confers resistance to the selection and allows cells 
to stably integrate the plasmid into their chromosomes and 
grow to form foci which in turn can be cloned and e3q>anded 
into cell lines. This method may advantageously be used to 
engineer cell lines which express the NHP product. Such 

10 engineered cell lines may be particularly useful in screening 
and evaluation of compounds that affect the endogenous 
activity of the NHP product. 

A number of selection systems may be used, including 
but not limited to the herpes simplex virus thymidine kinase 

15 (Wigler, efc al., 1977, Cell 11:223), hypoxanthine-guanine 

phosphoribosyl transferase (Szybalska & Szybalski, 1962, Proc. 
Natl. Acad. Sci. USA 48:2026), and adenine 

phosphoribosyl transferase (Lowy, et al., 1980, Cell 22:817) 
genes can be employed in tk", hgprt" or aprt" cells, 

20 respectively- Also, antimetabolite resistance can be used as 
the basis of selection for the following genes: dhfr, which 
confers resistance to methotrexate (Wigler, et al., 1980, 
Natl, Acad. Sci. USA 77:3567; O'Hare, et al., 1981, Proc. 
Natl. Acad. Sci. USA 78:1527); gpt, which confers resistance 

25 to mycophenolic acid (Mulligan & Berg, 1981, Proc. Natl. 

Acad. Sci. USA 78:2072); neo, which confers resistance to the 
aminoglycoside G-418 (Colberre-Garapin, et al., 1981, J, Mol. 
Biol. 150:1); and hygro, which confers resistance to 
hygromycin (Santerre, et al., 1984, Gene 30:147). 

30 Alternatively, any fusion protein can be readily 

purified by utilizing an antibody specific for the fusion 
protein being expressed. For example, a system described by 



26 



wo 01/94583 PCT/USOl/18393 

Janknecht et al. allows for the ready purification of non- 
denatured fusion proteins expressed in human cell lines 
(Janknecht, et al., 1991, Proc. Natl, Acad, Sci. USA 88 z 8912- 
8976) . In this system, the gene of interest is subcloned 
5 into a vaccinia recombination plasmid such that the gene's 
open reading frame is translationally fused to an amino- 
terminal tag consisting of six histidine residues. Extracts 
from cells infected with recombinant vaccinia virus are 
loaded onto Ni^^-nitriloacetic acid-agarose columns and 

10 histidine- tagged proteins are selectively eluted with 
imidazole-containing buffers . 

Also encompassed by the present invention are fusion 
proteins that direct the NHP to a target organ and/or 
facilitate transport across the menibrane into the cytosol. 

15 Conjugation of NHPs to antibody molecules or their Fab 

fragments could be used to target cells bearing a particular 
epitope. Attaching the appropriate signal sequence to the 
NHP would also transport the NHP to the desired location 
within the cell. Alternatively targeting of NHP or its 

20 nucleic acid sequence might be achieved using liposome or 

lipid complex based delivery systems. Such technologies are 
described in *Liposomes:A Practical Approach*, New,RRC ed., 
Oxford University Press, New York and in U.S. Patents Nos. 
4,594,595, 5,459,127, 5,948,767 and 6,110,490 and their 

25 respective disclosures which are herein incorporated by 

reference in their entirety. Additionally eiribodied are novel 
protein constructs engineered in such a way that they 
facilitate transport of the NHP to the target site or desired 
organ. This goal may be achieved by coupling of the NHP to a 

30 cytokine or other ligand that provides targeting specificity, 
and/or to a protein transducing domain (see generally U.S. 
applications Ser. No. 60/111,701 and 60/056,713, both of 
which are herein incorporated by reference, for examples of 
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such transducing sequences) to facilitate passage across 
cellular membranes if needed and can optionally be engineered 
to include nuclear localization sequences when desired. 

5 5.3 ANTIBODIES TO NHP PRODUCTS 

Antibodies that specifically recognize one or more 
epitopes of a NHP, or epitopes of conserved variants of a 
NHP, or peptide fragments of a NHP are also encompassed by 
the invention. Such antibodies include but are not limited 

10 to polyclonal antibodies, monoclonal antibodies (mAbs), 

humanized or chimeric antibodies, single chain antibodies, 
Fab fragments, F(ab')2 fragments, fragments produced by a Fab 
esqpression library, anti-idiotypic (anti-Id) antibodies, and 
epi tope-binding fragments of any of the above. 

15 The antibodies of the invention may be used, for 

exanple, in the detection of NHP in a biological sample and 
may, therefore, be utilized as part of a diagnostic or 
prognostic technique whereby patients may be tested for 
abnormal amoiints of NHP. Such antibodies may also be 

20 utilized in conjunction with, for example, compound screening 
schemes for the evaluation of the effect of test compounds on 
expression and/or activity of a NHP gene product. 
Additionally, such antibodies can be used in conjxmction gene 
therapy to, for example, evaluate the normal and/or 

25 engineered NHP-esqpressing cells prior to their introduction 
into the patient. Such antibodies may additionally be used 
as a method for the inhibition of abnormal NHP activity. 
Thus, such antibodies may, therefore, be utilized as part of 
treatment methods. 

30 For the production of antibodies, various host animals 

may be immunized by injection with a NHP, an NHP peptide 
{e.g., one corresponding to a functional domain of an NHP), 
truncated NHP polypeptides (NHP in which one or more domains 
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have been deleted) , functional equivalents of the NHP or 
mutated variant of the NHP. Such host animals may include 
but are not limited to pigs, rabbits, mice, goats, and rats, 
to name but a few. Various adjuvants may be used to increase 
5 the immunological response, depending on the host species, 
including but not limited to Freund's adjuvant (complete and 
incomplete) , mineral salts such as aluminum hydroxide or 
al\iminum phosphate, surface active substances such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil 

10 emulsions, and potentially useful hxnnan adjuvants such as BCG 
(bacille Calmette-Guerin) and Corynebacterium parvum. 
Alternatively, the immune response could be enhanced by 
combination and or coupling with molecules such as keyhole 
limpet hemocyanin, tetanus toxoid, diptheria toxoid, 

15 ovalbtamin, cholera toxin or fragments thereof. Polyclonal 
antibodies are heterogeneous populations of antibody 
molecules derived from the sera of the immunized animals. 

Monoclonal antibodies, which are homogeneous 
populations of antibodies to a particular antigen, can be 

20 obtained by any technique which provides for the production 
of antibody molecules by continuous cell lines in culture. 
Hiese include, but are not limited to, the hybridoma 
technique of Kohler and Milstein, (1975, Nature 25ff: 495-497; 
and U.S. Patent No. 4,376,110)/ the hioman B-cell hybridoma 

25 technique (Kosbor efc al., 1983, Immunology Today 4:72; Cole 
et al., 1983, Proc. Natl. Acad. Sci. USA 80:2026-2030), and 
the EBV-hybridoma technique (Cole et al., 1985, Monoclonal 
Antibodies And Cancer Therapy, Alan R. Liss, Inc., pp. 77- 
96) . Such antibodies may be of any imm\inoglobulin class 

30 including IgG, IgM, IgE, IgA, IgD and any svibclass thereof. 
The hybridoma producing the mAb of this invention may be 
cultivated in vitro or in vivo. Production of high titers of 
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mAbs in vivo makes this the presently preferred method of 
production. 

In addition, techniques developed for the production of 
"chimeric antibodies" (Morrison et al., 1984, Proc. Natl. 
5 Acad- Sci., 81:6851-6855; Neuberger et al., 1984, Nature, 
312:604-608; Takeda et al., 1985, Nature, 314:452-454) by 
splicing the genes from a mouse antibody molecule of 
appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity 

10 can be used. A chimeric antibody is a molecule in which 

different portions are derived from different animal species, 
such as those having a variable region derived from a murine 
mAb and a human immunoglobulin constant region. Such 
technologies are described in U.S. Patents Nos. 6,075,181 and 

15 5,877,397 and their respective disclosures which are herein 
incorporated by reference in their entirety. Also 
encompassed by the present invention is the use of fully 
. hiamanized monoclonal antibodies as described in US Patent No. 
6,150,584 and respective disclosures which are herein 

20 incorporated by reference in their entirety. 

Alternatively, techniques described for the production 
of single chain antibodies (U.S. Patent 4,946,778; Bird, 
1988, Science 242:423-426; Huston et al., 1988, Proc. Natl. 
Acad. Sci. USA 85:5879-5883; and Ward et al., 1989, Nature 

25 341:544-546) can be adapted to produce single chain 
antibodies against NHP gene products. Single chain 
antibodies are formed by linking the heavy and light chain 
fragments of the Fv region via an amino acid bridge, 
resulting in a single chain polypeptide. 

30 Antibody fragments which recognize specific epitopes 

may be generated by known techniques. For example, such 
fragments include, but are not limited to: the F(ab')2 
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fragments which can be produced by pepsin digestion of the 
antibody molecule and the Fab fragments which can be 
generated by reducing the disulfide bridges of the F(ab*)2 
fragments. Alternatively, Fab expression libraries may be 
5 constructed (Huse et al., 1989/ Science, 245:1275-1281) to 
allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity. 

Antibodies to a NHP can, in turn, be utilized to 
generate anti-idiotype antibodies that "mimic" a given NHP, 

10 using techniques well known to those skilled in the art, 

(See, e.g., 6reens£>an & Bona, 1993, FASEB J 7(^5; : 437-444; and 
Nissinoff, 1991, J. Immunol. 147^8; :2429-2438) . For example 
antibodies which bind to a NHP domain and competitively 
inhibit the binding of NHP to its cognate receptor can be 

15 used to generate anti-idiotypes that "mimic" the NHP and, 

therefore, bind and activate or neutralize a receptor. Such 
anti-idiotypic antibodies or Fab fragments of such anti- 
idiotypes can be used in therapeutic regimens involving a NHP 
mediated pathway. 

20 The present invention is not to be limited in scope by 

the specific embodiments described herein, which are intended 
as single illustrations of individual aspects of the 
invention, and fxinctionally equivalent methods and components 
are within the scope of the invention. Indeed, various 

25 modifications of the invention, in addition to those shown 

and described herein will become apparent to those skilled in 
the art from the foregoing description. Such modifications 
are intended to fall within the scope of the appended claims. 
All cited publications, patents, and patent applications are 

30 herein incorporated by reference in their entirety. 
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WHAT IS r !r,aTMT?.n t*;;- 

1. An isolated nucleic acid molecule comprising 
at a nucleotide sequence encoding an amino acid sequence 

5 drawn from the group consisting of SEQ ID NOS: 2, 4, 6, 8, 
10, and 12. 

2 . An isolated nucleic acid molecule comprising 
a nucleotide sequence that: 

10 (a) encodes the amino acid sequence shown in SEQ 

ID NO: 10; and 
(b) hybridizes under stringent conditions to the 
. nucleotide sequence of SEQ ID NO: 9 or the 
complement thereof. 

15 

3. An isolated nucleic acid molecule coirqprising 
a nucleotide sequence that encodes the amino acid sequence 
shown in SEQ ID NO: 10. 

20 4. An isolated nucleic acid molecule comprising 

a nucleotide sequence that encodes the amino acid sequence 
shown in SEQ ID NO: 4. 

5. An isolated nucleic acid molecule comprising 
25 a nucleotide sequence that encodes the amino acid sequence 
shown in SEQ ID NO: 2. 
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SEQUENCE LISTING 
<110> LEXICON GENETICS INCORPORATED 

<120> Novel Human Transporter Proteins and Polynucleotides Encoding the 
Same 

<130> LEX-0186-PCT 

<150> US 60/2X0,045 
<151> 2000-06-07 

<160> 13 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 2040 

<212> DNA 

<213> homo sapiens 

<400> 1 

atgacaggag caaagaggaa aaagaaaagc atgctttgga gcaagatgca taccccccag 60 

tgtgaagaca ttatacagtg gtgtagaagg cgactgccca ttttggattg ggcaccacat 120 

tacaatctga aagaaaactt gcttccagac actgtgtctg ggataatgtt ggcagttcaa 180 

caggtgaccc aaggattggc ctttgctgtt ctctcatctg tgcacccagt gtttggttta 240 

tatgggtctc tgtttcctgc cataatttat gccatatttg gaatgggaca tcatgttgcc 300 

acaggcacct ttgccttgac atccttaata tcagccaacg ccgtggaacg gattgtccct 360 

cagaacatgc agaatctcac cacacagagt aacacaagcg tgctgggctt atccgacttt 420 

gaaatgcaaa ggatccacgt tgctgcagca gtttccttct tgggaggtgt gattcaggtg 480 

gccatgtttg tgctgcaact gggcagtgcc acatttgtgg tcacagagcc tgtgatcagc 540 

gcaatgacaa ctggggctgc cacccatgtg gtgacttcac aagtcaaata tctcttggga 600 

atgaaaatgc catatatatc cggaccactt ggattctttt atatttatgc atatgttttt 660 

gaaaacatca agtctgtgcg actggaagca ttgcttttat ccttgctgag cattgtggtc 720 

cttgttcttg ttaaagagct gaatgaacag tttaaaagga aaattaaagt tgttcttcct 780 

gtagatttag ttttgattat tgctgcatca tttgcttgtt attgcaccaa tatggaaaac 840 

acatatggat tagaagtagt tggtcatatt ccacaaggaa ttccctcacc tagagctccc 900 

ccgatgaaca tcctctctgc ggtgatcact gaagctttcg gagtggcact tgtaggctat 960 

gtggcctcac tggctcttgc tcaaggatct gccaaaaaat tcaaatattc aattgatgac 1020 

aaccaggaat ttttggccca tggcctcagc aatatagttt cttcattttt cttctgcata 1080 

ccaagtgctg ctgccatggg aaggacggct ggcctgtaca gcacaggagc gaagacacag 1140 

gtggcttgtc taatatcttg cattttcgtc cttatagtca tctatgcaat aggacctttg 1200 

ctttactggc tgcccatgtg tgtccttgca agcattattg ttgtgggact gaagggaatg 1260 

ctaatacagt tccgagattt aaaaaaatat tggaatgtgg ataaaatcga ttggggaata 1320 

tgggtcagta catatgtatt tacaatatgc tttgctgcca atgtgggact gctgtttggt 1380 

gttgtttgta ccatagctat agtgatagga cgcttcccaa gagcaatgac tgtaagtata 1440 

aaaaatatga aagwaatgga atttaaagtg aagacagaaa tggacagtga aaccctgcag 1500 

caggtgaaaa ttatctcaat aaacaacccg cttgttttcc tgaatgcaaa aaaattttat 1560 

actgatttaa tgaacatgat ccaaaaggaa aatgcctgta atcagccact tgatgatatc 1620 

agcaagtgtg aacaaaacac attgcttaat tccctatcca atggcaactg caatgaagaa 1680 

gcttcacagt cctgccctaa tgagaagtgt tatttaatcc tggattgcag tggatttacc 1740 

ttttttgact attctggagt ctccatgctt gttgaggttt acatggactg taaaggcagg 1800 

agtgtggatg tattgttagc ccattgtaca gcttccttga taaaagcaat gacgtattat i860 

ggaaacctag actcagagaa accaattttt tttgaatcgg tatctgctgc aataagtcat 1920 

atccattcaa ataagatgga gtctcgctct gtctcccacg ctggagtgtc gcgatctcgg 1980 
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ctcactgcaa gctccgcctc ccgggttcac gcctttctcc tgcctcagcc tctcgagtag 2040 

<210> 2 

<211> 679 

<212> PRT 

<213> homo sapiens 



<400> 2 

Met Thr Gly Ala Lys Arg Lys Lys Lys Ser Met Leu Trp Ser Lys Met 

1 ■ 5 10 15 

His Thr Pro Gin Cys Glu Asp lie lie Gin Trp Cys Arg Arg Arg Leu 

20 25 30 

Pro lie Leu Asp Trp Ala Pro His Tyr Asn Leu Lys Glu Asn Leu Leu 

35 40 45 

Pro Asp Thr Val Ser Gly lie Met Leu Ala Val Gin Gin Val Thr Gin 

50 55 60 

Gly Leu Ala Phe Ala Val Leu Ser Ser Val His Pro Val Phe Gly Leu 
65 70 75 80 

Tyr Gly Ser Leu Phe Pro Ala lie He Tyr Ala He Phe Gly Met Gly 

85 90 95 

His His Val Ala Thr Gly Thr Phe Ala Leu Thr Ser Leu He Ser Ala 

100 105 110 

Asn Ala Val Glu Arg He Val Pro Gin Asn Met Gin Asn Leu Thr Thr 

115 120 125 

Gin Ser Asn Thr Ser Val Leu Gly Leu Ser Asp Phe Glu Met Gin Arg 

130 135 140 

He His Val Ala Ala Ala Val Ser Phe Leu Gly Gly Val He Gin Val 
145 150 155 160 

Ala Met Phe Val Leu Gin Leu Gly Ser Ala Thr Phe Val Val Thr Glu 

165 170 175 

Pro Val He Ser Ala Met Thr Thr Gly Ala Ala Thr His Val Val Thr 

180 185 190 

Ser Gin Val Lys Tyr Leu Leu Gly Met Lys Met Pro Tyr He Ser Gly 

195 200 205 

Pro Leu Gly Phe Phe Tyr He Tyr Ala Tyr Val Phe Glu Asn He Lys 

210 215 220 

Ser Val 2^g Leu Glu Ala Leu Leu Leu Ser Leu Leu Ser He Val Val 
225 230 235 240 

Leu Val Leu Val Lys Glu Leu Asn Glu Gin Phe Lys Arg Lys He Lys 

245 250 255 

Val Val Leu Pro Val Asp Leu Val Leu He He Ala Ala Ser Phe Ala 

260 265 270 

Cys Tyr Cys Thr Asn Met Glu Asn Thr Tyr Gly Leu Glu Val Val Gly 

275 280 285 

His He Pro Gin Gly He Pro Ser Pro Arg Ala Pro Pro Met Asn He 

290 295 300 

Leu Ser Ala Val He Thr Glu Ala Phe Gly Val Ala Leu Val Gly Tyr 
305 310 315 320 

Val Ala Ser Leu Ala Leu Ala Gin Gly Ser Ala Lys Lys Phe Lys Tyr 

325 330 335 

Ser He Asp Asp Asn Gin Glu Phe Leu Ala His Gly Leu Ser Asn He 

340 345 350 

Val Ser Ser Phe Phe Phe Cys He Pro Ser Ala Ala Ala Met Gly Arg 

355 360 365 

Thr Ala Gly Leu Tyr Ser Thr Gly Ala Lys Thr Gin Val Ala Cys Leu 
370 375 380 
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He Ser Cys He Phe Val Leu He Val He Tyr Ala He Gly Pro Leu 
385 ^ 390 395 400 

Leu Tyr Trp Leu Pro Met Cys Val Leu Ala Ser He He Val Val Gly 

405 410 415 

Leu Lys Gly Met Leu He Gin Phe Arg Asp Leu Lys Lys Tyr Trp Asn 

420 425 430 

Val Asp Lys He Asp Trp Gly He Trp Val Ser Thr Tyr Val Phe Thr 

435 440 445 

He Cys Phe, Ala Ala Asn Val Gly Leu Leu Phe Gly Val Val Cys Thr 

450 455 460 

He Ala He Val He Gly Arg Phe Pro Arg Ala Met Thr Val Ser He 
465 470 475 480 

Lys Asn Met Lys Glu Met Glu Phe Lys Val Lys Thr Glu Met Asp Ser 

485 490 495 

Glu Thr Leu Gin Gin Val Lys He He Ser He Asn Asn Pro Leu Val 

500 505 510 

Phe Leu Asn Ala Lys Lys Phe Tyr Thr Asp Leu Met Asn Met He Gin 

515 520 525 

Lys Glu Asn Ala Cys Asn Gin Pro Leu Asp Asp He Ser Lys Cys Glu 

530 535 540 

Gin Asn Thr Leu Leu Asn Ser Leu Ser Asn Gly Asn Cys Asn Glu Glu 
545 550 555 560 

Ala Ser Gin Ser Cys Pro Asn Glu Lys Cys Tyr Leu He Leu Asp Cys 

565 570 575 

Ser Gly Phe Thr Phe Phe Asp oyr Ser Gly Val Ser Met Leu Val Glu 

580 585 590 

Val Tyr Met Asp Cys Lys Gly Arg Ser Val Asp Val Leu Leu Ala His 

595 600 605 

Cys Thr Ala Ser Leu He Lys Ala Met Thr Tyr Tyr Gly Asn Leu Asp 

610 615 620 

Ser Glu Lys Pro He Phe Phe Glu Ser Val Ser Ala Ala He Ser His 
625 630 635 640 

He His Ser Asn Lys Met Glu Ser Arg Ser Val Ser His Ala Gly Val 

645 650 655 

Ser Arg Ser Arg Leu Thr Ala Ser Ser Ala Ser Arg Val His Ala Phe 

660 665 670 

Leu Leu Pro Gin Pro Leu Glu 
675 

<210> 3 

<211> 1866 

<212> DNA 

<213> homo sapiens 



<400> 3 

atgacaggag caaagaggaa aaagaaaagc 
tgtgeiagaca ttatacagtg gtgtagaagg 
tacaatctga aagaaaactt gcttccagac 
caggtgaccc aaggattggc ctttgctgtt 
tatgggtctc tgtttcctgc cataatttat 
acaggcacct ttgccttgac atccttaata 
cagaacatgc agaatctcac cacacagagt 
gaaatgcaaa ggatccacgt tgctgcagca 
gccatgtttg tgctgcaact gggcagtgcc 
gcaatgacaa ctggggctgc cacccatgtg 
atgaaaatgc catatatatc cggaccactt 



atgctttgga gcaagatgca taccccccag 60 

cgactgccca ttttggattg ggcaccacat 120 

actgtgtctg ggataatgtt ggcagttcaa 180 

ctctcatctg tgcacccagt gtttggttta 240 

gccatatttg gaatgggaca tcatgttgcc 300 

tcagccaacg ccgtggeiacg gattgtccct 360 

aacacaagcg tgctgggctt atccgacttt 420 

gtttccttct tgggaggtgt gattcaggtg 480 

acatttgtgg tcacagagcc tgtgatcagc 540 

gtgacttcac aagtcaaata tctcttggga 600 

ggattctttt atatttatgc atatgttttt 660 
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gaaaacatca agtctgtgcg actggaagca ttgcttttat ccttgctgag cattgtggtc 720 

cttgttcttg ttaaagagct gaatgaacag tttaaaagga aaattaaagt tgttcttcct 780 

gtagatttag ttttgattat tgctgcatca tttgcttgtt attgcaccaa tatggaaaac 840 

acatatggat tagaagtagt tggtcatatt ccacaaggaa ttccctcacc tagagctccc 900 

ccgatgaaca tcctctctgc ggtgatcact gaagctttcg gagtggcact tgtaggctat 960 

gtggcctcac tggctcttgc tcaaggatct gccaaaaaat tcaaatattc aattgatgac 1020 

aaccaggaat ttttggccca tggcctcagc aatatagttt cttcattttt cttctgcata 1080 

ccaagtgctg ctgccatggg aaggacggct ggcctgtaca gcacaggagc gaagacacag 1140 

gtggcttgtc taatatcttg cattttcgtc cttatagtca tctatgcaat aggacctttg 1200 

ctttactggc tgcccatgtg tgtccttgca agcattattg ttgtgggact gaagggaatg 1260 

ctaatacagt tccgagattt aaaaaaatat tggaatgtgg ataaaatcga ttggggaacc 1320 

ctgcagcagg tgaaaattat otcaataaac aacccgcttg ttttcctgaa tgcaaaaaaa 1380 

ttttatactg atttaatgaa catgatccaa aaggaaaatg cctgtaatca gccacttgat 1440 

gatatcagca agtgtgaaca aaacacattg cttaattccc tatccaatgg caactgcaat 1500 

gaagaagctt cacagtcctg ccctaatgag aagtgttatt taatcctgga ttgcagtgga 1560 

tttacctttt ttgactattc tggagtctcc atgcttgttg aggtttacat ggactgtaaa 1620 

ggcaggagtg tggatgtatt gttagcccat tgtacagctt* ccttgataaa agcaatgacg 1680 

tattatggaa acctagactc agagaaacca attttttttg aatcggtatc tgctgcaata 1740 

agtcatatcc attcaaataa gatggagtct cgctctgtct cccacgctgg agtgtcgcga 1800 

tctcggctca ctgcaagctc cgcctcccgg gttcacgcct ttctcctgcc tcagcctctc 1860 

gagtag 1866 

<210> 4 

<211> 621 

<212> PRT 

<213> homo sapiens 

<400> 4 

Met Thr Gly Ala Lys Arg Lys Lys Lys Ser Met I#eu Trp Ser Lys Met 

1 5 . 10 15 

His Thr Pro Gin Cys Glu Asp lie He Gin Trp Cys Arg Arg Arg Leu 

20 25 30 

Pro He Leu Asp Trp Ala Pro His Tyr Asn Leu Lys GXu Asn Leu Leu 

35 40 45 

Pro Asp Thr Val Ser Gly He Met Leu Ala Val Gin Gin Val Thr Gin 

50 55 60 

Gly Leu Ala Phe Ala Val Leu Ser Ser Val His Pro Val Phe Gly Leu 
65 70 75 80 

Tyr Gly Ser Leu Phe Pro Ala He He Tyr Ala He Phe Gly Met Gly 

85 90 95 

His His Val Ala Thr Gly Thr Phe Ala Leu Thr Ser Leu He Ser Ala 

100 105 110 

Asn Ala Val Glu Arg He Val Pro Gin Asn Met Gin Asn Leu Thr Thr 

115 120 125 

Gin Ser Asn Thr Ser Val Leu Gly Leu Ser Asp Phe Glu Met Gin Arg 

130 135 140 

He His Val Ala Ala Ala Val Ser Phe Leu Gly Gly Val He Gin Val 
145 150 155 160 

Ala Met Phe Val Leu Gin Leu Gly Ser Ala Thr Phe Val Val Thr Glu 

165 170 175 

Pro Val He Ser Ala Met Thr Thr Gly Ala Ala Thr His Val Val Thr 

180 185 190 

Ser Gin Val Lys Tyr Leu Leu Gly Met Lys Met Pro Tyr He Ser Gly 

195 200 205 

Pro Leu Gly Phe Phe Tyr He Tyr Ala Tyr Val Phe Glu Asn He Lys 
210 215 220 
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Ser Val Arg Leu Glu Ala Leu Leu Leu Sex Leu Leu Ser lie Val Val 
225 230 235 240 

Leu Val Leu Val Lys Glu Leu Asn Glu Gin Phe Lys Arg Lys lie Lys 

245 250 255 

Val Val Leu Pro Val Asp Leu Val Leu lie lie Ala Ala Ser Phe Ala 

260 265 270 

Cys Tyr Cys Thr Asn Met Glu Asn Thr Tyr Gly Leu Glu Val Val Gly 

275 280 285 

His lie Pro Gin Gly He Pro Ser Pro Arg Ala Pro Pro Met Asn He 

290 295 300 

Leu Ser Ala Val He Thr Glu Ala Phe Gly Val Ala Leu Val Gly Tyr 
305 310 315 320 

Val Ala Ser Leu Ala Leu Ala Gin Gly Ser Ala Lys Lys Phe Lys Tyr 

325 330 335 

Ser He Asp Asp Asn Gin Glu Phe Leu Ala His Gly Leu Ser Asn He 

340 345 350 

Val Ser Ser Phe Phe Phe Cys He Pro Ser Ala Rla Ala Met Gly Arg 

355 360 365 

Thr Ala Gly Leu Tyr Ser Thr Gly Ala Lys Thr Gin Val Ala Cys Leu 

370 375 380 

He Ser Cys He Phe Val Leu He Val He Tyr Ala He Gly Pro Leu 
385 390 395 400 

Leu Tyr Trp Leu Pro Met Cys Val Leu Ala Ser He He Val Val Gly 

405 410 415 

Leu Lys Gly Met Leu He Gin Phe Arg Asp Leu Lys Lys Tyr Trp Asn 

420 425 430 

Val Asp Lys He Asp Trp Gly Thr Leu Gin Gin Val Lys He He Ser 

435 440 445 

He Asn Asn Pro Leu Val Phe Leu Asn Ala Lys Lys Phe Tyr Thr Asp 

450 455 460 

Leu Met 2^n Met He Gin Lys Glu Asn Ala Cys Asn Gin Pro Leu Asp 
465 470 475 . 480 

Asp He Ser Lys Cys Glu Gin Asn Thr Leu Leu Asn Ser Leu Ser Asn 

485 490 495 

Gly Asn cys Asn Glu Glu Ala Ser Gin Ser Cys Pro Asn Glu Lys Cys 

500 505 510 

Ty^ Leu He Leu Asp Cys Ser Gly Phe Thr Phe Phe Asp Tyr Ser Gly 

515 520 525 

Val Ser Met Leu Val Glu Val Tyr Met Asp Cys Lys Gly Arg Ser Val 

530 535 540 

Asp Val Leu Leu Ala His Cys Thr Ala Ser Leu He Lys Ala Met Thr 
545 550 555 560 

Tyr Tyr Gly Asn Leu Asp Ser Glu Lys Pro He Phe Phe Glu Ser Val 

565 570 575 

Ser Ala Ala He Ser His He His Ser Asn Lys Met Glu Ser Arg Ser 

580 585 590 

Val Ser His Ala Gly Val Ser Arg Ser Arg Leu Thr Ala Ser Ser Ala 

595 .600 605 

Ser Arg Val His Ala Phe Leu Leu Pro Gin Pro Leu Glu 
610 615 620 



<210> 5 

<211> 1992 

<212> DNA 

<213> homo sapiens 
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<400> 5 

atgacaggag caaagaggaa aaagaaaagc atgctttgga gcaagatgca taccccccag 60 

tgtgaagaca ttatacagtg gtgtagaagg cgactgccca ttttggattg ggcaccacat 120 

tacaatctga aagaaaactt gcttccagac actgtgtctg ggataatgtt ggcagttcaa 180 

caggtgaccc aaggattggc ctttgctgtt ctctcatctg tgcacccagt gtttggttta 240 

tatgggtctc tgtttcctgc cataatttat gccatatttg gaatgggaca tcatgttgcc 300 

acaggcacct ttgccttgac atccttaata tcagccaacg ccgtggaacg gattgtccct 360 

cagaacatgc agaatctcac cacacagagt aacacaagcg tgctgggctt atccgacttt 420 

gaaatgcaaa ggatccacgt tgctgcagca gtttccttct tgggaggtgt gattcaggtg 480 

gccatgtttg tgctgcaact gggcagtgcc acatttgtgg tcacagagcc tgtgatcagc 540 

gcaatgacaa ctggggctgc cacccatgtg gtgacttcac aagtcaaata tctcttggga 600 

atgaaaatgc catatatatc cggaccactt ggattctttt atatttatgc atatgttttt 660 

gaaaacatca agtctgtgcg actggaagca ttgcttttat ccttgctgag cattgtggtc 720 

cttgttcttg ttaaagagct gaatgaacag tttaaaagga aaattaaagt tgttcttcct 780 

gtagatttag ttttgattat tgctgcatca tttgcttgtt attgcaccaa tatggaaaac 840 

acatatggat kagaagtagt tggtcatatt ccacaaggaa ttccctcacc tagagctccc 900 

ccgatgaaca tcctctctgc ggtgatcact gaagctttcg gagtggcact tgtaggctat 960 

gtggcctcac tggctcttgc tcaaggatct gccaaaaaat tcaaatattc aattgatgac 1020 

aaccaggaat ttttggccca tggcctcagc aatatagttt cttcattttt cttctgcata 1080 

ccaagtgctg ctgccatggg aaggacggct ggcctgtaca gcacaggagc gaagacacag 1140 

gtggcttgtc taatatcttg cattttcgtc cttatagtca tctatgcaat aggacctttg 1200 

ctttactggc tgcccatgtg tgtccttgca agcattattg ttgtgggact gaagggaatg 1260 

ctaatacagt tccgagattt aaaaaaatat tggaatgtgg ataaaatcga ttggggaata 1320 

tgggtcagta catatgtatt tacaatatgc tttgctgcca atgtgggact gctgtttggt 1380 

gttgtttgta ccatagctat agtgatagga cgcttcccaa gagcaatgac tgteiagtata 1440 

aaaaatatga aagwaatgga atttaaagtg aagacagaaa tggacagtga aaccctgcag 1500 

caggtgaaaa ttatctcaat aaacaacccg cttgttttcc tgaatgcaaa aaaattttat 1560 

actgatttaa tgaacatgat ccaaaaggaa aatgcctgta atcagccact tgatgatatc 1620 

agcaagtgtg aacaaaacac attgcttaat tccctatcca atggcaactg caatgaagaa 1680 

gcttcacagt cctgccctaa tgagaagtgt tatttaatcc tggattgcag tggatttacc 1740 

ttttttgact attctggagt ctccatgctt gttgaggttt acatggactg taaaggcagg 1800 

agtgtggatg tattgttagc ccattgtaca gcttccttga taaaagcaat gacgtattat 1860 

ggaaacctag actcagagaa accaattttt tttgaatcgg tatctgctgc aataagtcat 1920 

atccattcaa ataaggccag ttataaactg ttgtttgata acttggatct tccaacaatg 1980 

ccaccgctct ga 1992- 

<210> 6 

<211> 663 

<212> PRT 

<213> homo sapiens 

<400> 6 

Met Thr Gly Ala Lys Arg Lys Lys Lys Ser Met Leu Trp Ser Lys Met 

15 10 15 

His Thr Pro Gin Cys Glu Asp He He Gin Trp Cys Arg Arg Arg Leu 

20 25 30 

Pro He Leu Asp Trp Ala Pro His Tyr Asn Leu Lys Glu Asn Leu Leu 

35 40 45 

Pro Asp Thr Val Ser Gly He Met Leu Ala Val Gin Gin Val Thr Gin 

50 55 60 

Gly Leu Ala Phe Ala Val Leu Ser Ser Val His Pro Val Phe Gly Leu 
65 70 75 80 

Tyr Gly Ser Leu Phe Pro Ala He He Tyr Ala He Phe Gly Met Gly 

85 90 95 

His His Val Ala Thr Gly Thr Phe Ala Leu Thr Ser Leu He Ser Ala 
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Asn Ala Val Glu Arg He Val Pro Gin Asn Met Gin Asn Leu Thr Thr 

115 120 125 

Gin Ser Asn Thr Ser Val Leu Gly Leu Ser Asp Phe Glu Met Gin Arg 

130 135 140 

He His Val Ala Ala Ala Val Ser Phe Leu Gly Gly Val He Gin Val 
145 150 155 160 

Ala Met Phe Val Leu Gin Leu Gly Ser Ala Thr Phe Val Val Thr Glu 

165 170 175 

Pro Val He Ser Ala Met Thr Thr Gly Ala Ala Thr His Val Val Thr 

180 185 190 

Ser Gin Val Lys Tyr Leu Leu Gly Met Lys Met Pro Tyr He Ser Gly 

195 200 205 

Pro Leu Gly Phe Phe Tyr He Tyr Ala Tyr Val Phe Glu Asn He Lys 

210 215 220 

Ser Val Arg Leu Glu Ala Leu Leu Leu Ser Leu Leu Ser He Val Val 
225 230 235 240 

Leu Val Leu Val Lys Glu Leu Asn Glu Gin Phe Lys Arg Lys He Lys 

245 250 255 

Val Val Leu Pro Val Asp Leu Val Leu He He Ala Ala Ser Phe Ala 

260 265 270 

Cys Tyr Cys Thr Asn Met Glu Asn Thr. Tyr Gly Leu Glu Val Val Gly 

275 280 285 

His He Pro Gin Gly He Pro Ser Pro Arg Ala Pro Pro Met Asn He 

290 295 300 

Leu Ser Ala Val He Thr Glu Ala Phe Gly Val Ala Leu Val Gly Tyr 
305 310 315 320 

Val Ala Ser Leu Ala Leu Ala Gin Gly Ser Ala Lys Lys Phe Lys Tyr 

325 330 335 

Ser He Asp Asp Asn Gin Glu Phe Leu Ala His Gly Leu Ser Asn He 

340 345 350 

Val Ser Ser Phe Phe Phe Cys He Pro Ser Ala Ala Ala Met Gly Arg 

355 360 365 

Thr Ala Gly Leu Tyr Ser Thr Gly Ala Lys Thr Gin Val Ala Cys Leu 

370 375 380 

He Ser Cys He Phe Val Leu He Val He Tyr Ala He Gly Pro Leu 
385 390 395 400 

Leu Tyr Trp Leu Pro Met Cys Val Leu Ala Ser He He Val Val Gly 

405 410 415 

Leu Lys Gly Met Leu He Gin Phe Arg Asp Leu Lys Lys Tyr Trp Asn 

420 425 430 

Val Asp Lys He Asp Trp Gly He Trp Val Ser Thr Tyr Val Phe Thr 

435 440 445 

He Cys Phe Ala Ala Asn Val Gly Leu Leu Phe Gly Val Val Cys Thr 

450 455 460 

He Ala He Val He Gly Arg Phe Pro Arg Ala Met Thr Val Ser He 
465 470 475 480 

Lys Asn Met Lys Glu Met Glu Phe Lys Val Lys Thr Glu Met Asp Ser 

485 490 495 

Glu Thr Leu Gin Gin Val Lys He He Ser He Asn Asn Pro Leu Val 

500 505 510 

Phe Leu Asn Ala Lys Lys Phe Tyr Thr Asp Leu Met Asn Met He Gin 

515 520 525 

Lys Glu Asn Ala Cys Asn Gin Pro Leu Asp Asp He Ser Lys Cys Glu 

530 535 540 

Gin Asn Thr Leu Leu Asn Ser Leu Ser Asn Gly Asn Cys Asn Glu Glu 
545 550 555 560 
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Gin 


Ser 


Cys 


Pro 


Asn 


Glu 


Lys 


Cys 


Tyr Leu He Leu 


Asp 


Cys 








565 










570 




575 




Ser Gly 


Phe 


Thr 


Phe 


Pne 


Asp 


Tyr 


Ser 


Gly 


Val Ser Met Leu 


Val 


Glu 






580 










585 




590 






Val Tyx 


Met 


Asp 


Qys 


Lys 


Gly 


Arg 


Ser 


Val 


Asp Val Leu Leu 


Ala 


His 




595 










600 






605 






Cys Thr 


Ala 


Ser 


Leu 


lie 


Lys 


Ala 


Met 


Thr 


Tyr Tyr Gly Asn 


Leu 


Asp 


610 










615 








620 






Ser Glu 


Lys 


Pro 


He 


Phe 


Phe 


Glu 


Ser 


Val 


Ser Ala Ala He 


Ser 


His 


625 








630 










635 




640 


lie His 


Ser 


Asn 


Lys 


Ala 


Ser 


Tyr 


Lys 


Leu 


Leu Phe Asp Asn 


Leu 


Asp 








645 










650 




655 




Leu Pro 


Thr 


Met 


Pro 


Pro 


Leu 















660 



<210> 7 

<211> 1818 

<212> DNA 

<213> homo sapiens 

<400> 7 

atgacaggag caaagaggaa aaagaaaagc atgctttgga gcaagatgca taccccccag 60 

tgtgaagaca ttatacagtg gtgtagaagg cgactgccca ttttggattg ggcaccacat 120 

tacaatctga aagaaaactt gcttccagac actgtgtctg ggataatgtt ggcagttcaa 180 

caggtgaccc aaggattggc ctttgctgtt ctctcatctg tgcacccagt gtttggttta 240 

tatgggtctc tgtttcctgc cataatttat gccatatttg gaatgggaca tcatgttgcc 300 

acaggcacct ttgccttgac atccttaata tcagccaacg ccgtggaacg gattgtccct 360 

cagaacatgc agaatctcac cacacagagt aacacaagcg tgctgggctt atccgacttt 420 

gaaatgcaaa ggatccacgt tgctgcagca gtttccttct tgggaggtgt gattcaggtg 480 

gccatgtttg tgctgcaact gggcagtgcc acatttgtgg tcacagagcc tgtgatcagc 540 

gcaatgacaa ctggggctgc cacccatgtg gtgacttcac aagtcaaata tctcttggga 600 

atgaaaatgc catatatatc cggaccactt ggattctttt atatttatgc atatgttttt 660 

gaaaacatca agtctgtgcg actggaagca ttgcttttat ccttgctgag cattgtggtc 720 

cttgttcttg ttaaagagct gaatgaacag tttaaaagga aaattaaagt tgttcttcct 780 

gtagatttag ttttgattat tgctgcatca tttgcttgtt attgcaccaa tatggaaaac 840 

acatatggat tagaagtagt tggtcatatt ccacaaggaa ttccctcacc tagagctccc 900 

ccgatgaaca tcctctctgc ggtgatcact gaagctttcg gagtggcact tgtaggctat 960 

gtggcctcac tggctcttgc tcaaggatct gccaaaaaat tcaaatattc aattgatgac 1020 

aaccaggaat ttttggccca tggcctcagc aatatagttt cttcattttt cttctgcata 1080 

ccaagtgctg ctgccatggg aaggacggct ggcctgtaca gcacaggagc gaagacacag 1140 

gtggcttgtc taatatcttg cattttcgtc cttatagtca tctatgcaat aggacctttg 1200 

ctttactggc tgcccatgtg tgtccttgca agcattattg ttgtgggact gaagggaatg 1260 

ctaatacagt tccgagattt aaaaaaatat tggaatgtgg ataaaatcga ttggggaacc 1320 

ctgcagcagg tgaaaattat ctcaataaac aacccgcttg ttttcctgaa tgcaaaaaaa 1380 

ttttatactg atttaatgaa catgatccaa aaggaaaatg cctgtaatca gccacttgat 1440 

gatatcagca agtgtgaaca aaacacattg cttaattccc tatccaatgg caactgcaat 1500 

gaagaagctt cacagtcctg ccctaatgag aagtgttatt taatcctgga ttgcagtgga 1560 

tttacctttt ttgactattc tggagtctcc atgcttgttg aggtttacat ggactgtaaa 1620 

ggcaggagtg tggatgtatt gttagcccat tgtacagctt ccttgataaa agcaatgacg 1680 

tattatggaa acctagactc agagaaacca attttttttg aatcggtatc tgctgcaata 1740 

agtcatatcc attcaaataa ggccagttat aaactgttgt ttgataactt ggatcttcca 1800 

acaatgccac cgctctga * 1818 

<210> 8 
<211> 605 
<212> PRT 
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<213> homo sapiens 



<400> 8 





Thr 


Gly 


Ala 


Lys Arg 


Lys 


Lys 


Lys 


Ser 


Met Leu Trp Ser Lys Met 


J. 








5 










1 A 

lU 


15 


His 


Thr 


Pro 


Gin 


Cys 


Glu 


Asp 


He 


He 


Gin 


Trp Cys Arg Arg Arg Leu 


















25 




30 


Pro 


Tl A 

xj.e 


Leu 


Asp 


Trp Ala 


Fro 


IT -I « 

His 


Tyr 


Asn 


Leu Lys Glu Asn Leu Leu 
















40 






45 


Pro 


Asp 


Thr 


Val 


Ser 


triy 


He 


Met 


Leu 


Ala 


Val Gin Gin Val Thr Gin 




DO 










55 








60 


Gly 


Leu 


Ala 


Phe 


Ala 


Val 


Leu 


Ser 


Ser 


Val 


His Pro Val Phe Gly Leu 


65 










70 










75 80 


Tyr 


Gly 


Ser 


Leu 


pne 


Pro 


Ala 


He 


He 


Tyr 


Ala He Phe Gly Met Gly 










85 










90 


95 


His 


His 


Val 


Ala 


Thr 


Gly 


Thr 


Phe 


Ala 


Leu 


Thr Ser Leu He Ser Ala 








100 










105 




110 


Asn 


Ala 


Val 


Glu 


Arg 


He 


Val 


Pro 


Gin 


Asn 


Met Gin Asn Leu Thr Thr 






lie 

115 










120 






125 


Gin 


Ser 


Asn 


Thr 


Ser 


val 


Leu 


Gly 


Leu 


Ser 


Asp Phe Glu Met Gin Arg 




130 










135 








140 


lie 


His 


Val 


Ala 


Ala 


Ala 


Val 


Ser 


Phe 


Leu 


Gly Gly Val He Gin Val 


145 










150 










155 160 


Ala 


Met 


Phe 


Val 


Leu 


Gin 


Leu 


Gly 


Ser 


Ala 


Thr Phe Val Val Thr Glu 










165 










170 


175 


Pro 


Val 


He 


Ser 


Ala 


Met 


Thr 


Thr 


Gly 


Ala 


Ala Thr His Val Val Thr 








180 










185 




190 


Ser 


Gin 


Val 


Lys 




Leu 


Leu 


Gly 


Met 


Lys 


Met Pro Tyr He Ser Gly 






195 










200 






205 


Pro 


Leu 


Gly 


Phe 


Phe 


Tyr 


He 


Tyr 


Ala 


Tyr 


Val Phe Glu Asn He Lys 




210 










215 








220 


Ser 


Val 


Arg 


Leu 


Glu 


Ala 


Leu 


Leu 


Leu 


Ser 


Leu Leu Ser He Val Val 


225 










230 










235 240 


Leu 


Val 


Leu 


Val 


Lys 


Glu 


Leu 


Asn 


Glu 


Gin 


Phe Lys Arg Lys He Lys 










245 










250 


255 


Val 


Val 


Leu 


Pro 


Val 


Asp 


Leu 


Val 


Leu 


He 


He Ala Ala Ser Phe Ala 








260 










265 




270 


Cys 


Tyr 


Cys 


Thr 


Asn 


Met 


Glu 


Asn 


Thr 


Tyr 


Gly Leu Glu Val Val Gly 






275 










280 






285 


HXS 


He 


Pro 


Gin 


uiy 


Xl A 

lie 


Pro 


Ser 


Pro 


Arg 


Ala Pro Pro Met Asn He 




o Q n 










295 








300 


Leu 


Ser 


Ala 


Val 


He 


Thr 


Glu 


Ala 


Phe 


Gly 


Val Ala Leu Val Gly Tyr 












310 










315 320 


va± 


Axa 


Ser 


Lieu 


Ala 


Leu 


Xt M 

Ala 


Gin 


Gly 


Ser 


Ala Lys Lys Phe Lys Tyr 










325 










*i "a 

330 


335 


Ser 


x±e 


Asp 


ASp 


Asn 


Gin 


Glu 


Phe 


Leu 


Ala 


His Gly Leu Ser Asn He 






















'itzn 


Val 


Ser 


Ser 


Phe 


Phe 


Phe 


Cys 


He 


Pro 


Ser 


Ala Ala Ala Met Gly Arg 






355 










360 






365 


Thr 


Ala 


Gly 


Leu 


Tyr 


Ser 


Thr 


Gly 


Ala 


Lys 


Thr Gin Val Ala Cys Leu 




370 










375 








380 


He 


Ser 


Cys 


He 


Phe 


Val 


Leu 


He 


Val 


He 


Tyr Ala He Gly Pro Leu 


385 










390 










395 400 


Leu 


Tyr 


Trp 


Leu 


Pro 


Met 


Cys 


Val 


Leu 


Ala 


Ser He He Val Val Gly 










405 










410 


415 


Leu 


Lys 


Gly 


Met 


Leu 


He 


Gin 


Phe 


Arg 


Asp 


Leu Lys Lys Tyr Trp Asn 
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420 




425 










430 






Val 


Asp 


Lys 
435 


He 


Asp Trp Gly Thr 
440 


Leu 


Gin 


Gin 


Val 


Lys 
445 


He 


He 


Ser 


lie 


Asn 
450 


Asn 


Pro 


Leu Val Phe Leu 
455 


Asn 


Ala 


Lys 


Lys 
460 


Phe 


Tyr 


Thr 


Asp 


Leu 


Met 


Asn 


Met 


He Gin Lys Glu 


Asn 


Ala 


Cys 


Asn 


Gin 


Pro 


Leu 


Asp 


465 








470 






475 










480 


Asp 


He 


Ser 


Lys 


Cys Glu Gin Asn 
485 


Thr 


Leu 
490 


Leu 


Asn 


Ser 


Leu 


Ser 

495 


Asn 


Gly 


Asn 


Cys 


Asn 
500 


Glu Glu Ala Ser 


Gin 
505 


Ser 


Cys 


Pro 


Asn 


Glu 
510 


Lys 


Cys 


Tyr 


Leu 


He 
515 


Leu 


Asp Cys Ser Gly 
520 


Phe 


Thr 


Phe 


Phe 


Asp 
525 


Tyr 


Ser 


Gly 


Val 


Ser 
530 


Met 


Leu 


Val Glu Val Tyr 
535 


Met 


Asp 


Cys 


Lys 
540 


Gly 


Arg 


Ser 


Val 


Asp 


Val 


Leu 


Leu 


Ala His Cys Thr 


Ala 


Ser 


Leu 


He 


Lys 


Ala 


Met 


Thr 


545 








550 






555 










560 


Tyr 


Tyr 


Gly 


Asn 


Leu Asp Ser Glu 
565 


Lys 


Pro 
570 


He 


Phe 


Phe 


Glu 


Ser 
575 


Val 


Ser 


Ala 


Ala 


He 
580 


Ser His He His 


Ser 
585 


Asn 


Lys 


Ala 


Ser 


Tyr 
590 


Lys 


Leu 


Leu 


Phe 


Asp 
595 


Asn 


Leu Asp Leu Pro 
600 


Thr 


Met 


Pro 


Pro 


Leu 
605 









<210> 9 

<211> 1971 

<212> DNA 

<213> homo sapiens 

<400> 9 

atgacaggag caaagaggaa aaagaaaagc atgctttgga gcaagatgca taccccccag 60 

tgtgaagaca ttatacagtg gtgtagaagg cgactgccca ttttggattg ggcaccacat 120 

tacaatctga aagaaaactt gcttccagac actgtgtctg ggataatgtt ggcagttcaa 180 

caggtgaccc aaggattggc ctttgctgtt ctctcatctg tgcacccagt gtttggttta 240 

tatgggtctc tgtttcctgc cataatttat gccatatttg gaatgggaca tcatgttgcc 300 

acaggcacct ttgccttgac atccttaata tcagccaacg ccgtggaacg gattgtccct 360 

cagaacatgc agaatctcac cacacagagt aacacaagcg tgctgggctt atccgacttt 420 

gaaatgcaaa ggatccacgt tgctgcagca gtttccttct tgggaggtgt gattcaggtg 480 

gccatgtttg tgctgcaact gggcagtgcc acatttgtgg tcacagagcc tgtgatcagc 540 

gcaatgacaa ctggggctgc cacccatgtg gtgacttcac aagtcaaata tctcttggga 600 

atgaaaatgc catatatatc cggaccactt ggattctttt atatttatgc atatgttttt 660 

gaaaacatca agtctgtgcg actggaagca ttgcttttat ccttgctgag cattgtggtc 720 

cttgttcttg ttaaagagct gaatgaacag tttaaaagga aaattaaagt tgttcttcct 780 

gtagatttag ttttgattat tgctgcatca tttgcttgtt attgcaccaa tatggaaaac 840 

acatatggat tagaagtagt tggtcatatt ccacaaggaa ttccctcacc tagagctccc 900 

ccgatgaaca tcctctctgc ggtgatcact gaagctttcg gagtggcact tgtaggctat 960 

gtggcctcac tggctcttgc tcaaggatct gccaaaaaat tcaaatattc aattgatgac 1020 

aaccaggaat ttttggccca tggcctcagc aatatagttt cttcattttt cttctgcata 1080 

ccaagtgctg ctgccatggg aaggacggct ggcctgtaca gcacaggagc gaagacacag 1140 

gtggcttgtc taatatcttg cattttcgtc cttatagtca tctatgcaat aggacctttg 1200 

ctttactggc tgcccatgtg tgtccttgca agcattattg ttgtgggact gaagggaatg 1260 

ctaatacagt tccgagattt aaaaaaatat tggaatgtgg ataaaatcga ttggggaata 1320 

tgggtcagta catatgtatt tacaatatgc tttgctgcca atgtgggact gctgtttggt 1380 

gttgtttgta ccatagctat agtgatagga cgcttcccaa gagcaatgac tgtaagtata 1440 

aaaaatatga aagwaatgga atttaaagtg aagacagaaa tggacagtga aaccctgcag 1500 

caggtgaaaa ttatctcaat aaacaacccg cttgttttcc tgaatgcaaa aaaattttat 1560 
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actgatttaa tgaacatgat ccaaaaggaa aatgcctgta atcagccact tgatgatatc 1620 

agcaagtgtg aacaaaacac attgcttaat tccctatcca atggcaactg caatgaagaa 1680 

gcttcacagt cctgccctaa tgagaagtgt tatttaatcc tggattgcag tggatttacc 1740 

ttttttgact attctggagt ctccatgctt gttgaggttt acatggactg taaaggcagg 1800 

agtgtggatg tattgttagc ccattgtaca gcttccttga taaaagcaat gacgtattat 1860 

ggaaacctag actcagagaa accaattttt tttgaatcgg tatctgctgc aataagtcat 1920 

atccattccia ataagaattt gagcaaactc agtgaccaca gtgaagtctg a 1971 

<210> 10 

<211> 656 

<212> PRT 

<213> hcxmo sapiens 

<400> 10 

Met Thr Gly Ala Lys Arg Lys Lys Lys Ser Met Leu Trp Ser Lys Met 

15 10 15 

His Thr Pro Gin Cys Glu Asp lie lie Gin Trp Cys Arg Arg Arg Leu 

20 25 30 

Pro lie Leu Asp Trp Ala Pro His Tyr Asn Leu Lys Glu Asn Leu Leu 

35 40 45 

Pro Asp Thr Val Ser Gly He Met Leu Ala Val Gin Gin Val Thr Gin 

50 55 60 

Gly Leu Ala Phe Ala Val Leu Ser Ser Val His Pro Val Phe Gly Leu 
65 70 75 80 

Tyr Gly Ser Leu Phe Pro Ala He He Tyr Ala He Phe Gly Met Gly 

85 90 95 

His His Val Ala Thr Gly Thr Phe Ala Leu Thr Ser Leu lie Ser Ala 

100 105 110 

Asn Ala Val Glu Arg He Val Pro Gin Asn Met .Gin Asn. Leu Thr Thr 

115 120 125 

Gin Ser Asn Thr Ser Val Leu Gly Leu Ser Asp Phe Glu Met Gin Arg 

130 135 140 

He His Val Ala Ala Ala Val Ser Phe Leu Gly Gly Val He Gin Val 
145 150 155 160 

Ala Met Phe Val Leu Gin Leu Gly Ser Ala Thr Phe Val Val Thr Glu 

165 170 175 

Pro Val He Ser Ala Met Thr Thr Gly Ala Ala Thr His Val Val Thr 

180 185 190 

Ser Gin Val Lys Tyr Leu Leu Gly Met Lys Met Pro Tyr He Ser Gly 

195 200 205 

Pro Leu Gly Phe Phe Tyr He Tyr Ala Tyr Val Phe Glu Asn He Lys 

210 215 220 

Ser Val Arg Leu Glu Ala Leu Leu Leu Ser Leu Leu Ser He Val Val 
225 230 235 240 

Leu Val Leu Val Lys Glu LeU Asn Glu Gin Phe Lys Arg Lys He Lys 

245 250 255 

Val Val Leu Pro Val Asp Leu Val Leu He He Ala Ala Ser Phe Ala 

260 265 270 

Cys Tyr Cys Thr Asn Met Glu Asn Thr Tyr Gly Leu Glu Val Val Gly' 

275 280 285 

His He Pro Gin Gly He Pro Ser Pro Arg Ala Pro Pro Met Asn He 

290 295 300 

Leu Ser Ala Val lie Thr Glu Ala Phe Gly Val Ala Leu Val Gly Tyr 
305 310 315 320 

Val Ala Ser Leu Ala Leu Ala Gin Gly Ser Ala Lys Lys Phe Lys Tyr 
325 330 335 
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Ser lie Asp Asp Ash Gin Glu Phe Leu Ala His Gly Leu Ser Asn Xle 

340 345 350 

Val Ser Ser Phe Phe Phe Cys lie Pro Ser Ala Ala Ala Met Gly Arg 

355 360 365 

Thr Ala Gly Leu Tyr Ser Thr Gly Ala Lys Thr Gin Val Ala Cys Leu 

370 375 380 

lie Ser Cys He Phe Val Leu He Val He Tyr Ala He Gly Pro Leu 
385 390 395 400 

Leu Tyr Trp Leu Pro Met Cys Val Leu Ala Ser He He Val Val Gly 

405 410 415 

Leu Lys Gly Met Leu He Gin Phe Arg Asp Leu Lys Lys Tyr Trp Asn 

420 425 430 

Val Asp Lys He Asp Trp Gly He Trp Val Ser Thr Tyr Val Phe Thr 

435 440 . 445 

He Cys Phe Ala Ala Asn Val Gly Leu Leu Phe Gly Val Val Cys Thr 

450 455 460 

He Ala He Val He Gly Arg Phe Pro Arg Ala Met Thr Val Ser He 
465 470 475 480 

Lys Asn Met Lys Glu Met Glu Phe Lys Val Lys Thr Glu Met Asp Ser 

485 490 495 

Glu Thr Leu Gin Gin Val Lys He He Ser He Asn Asn Pro Leu Val 

500 505 510 

Phe Leu Asn Ala Lys Lys Phe Tyr Thr Asp Leu Met Asn Met He Gin 

515 520 525 

Lys Glu Asn Ala Cys Asn Gin Pro Leu Asp Asp He Ser Lys Cys Glu 

530 535 540 

Gin Asn Thr Leu Leu Asn Ser Leu Ser Asn Gly Asn Cys Asn Glu Glu 
545 550 555 560 

Ala Ser Gin Ser Cys Pro Asn Glu Lys Cys Tyr Leu He Leu Asp Cys 

565 570 575 

Ser Gly Phe Thr Phe Phe Asp Tyr Ser Gly Val Ser Met Leu Val Glu 

580 585 .590 

Val Tyr Met Asp Cys Lys Gly Arg Ser Val Asp Val Leu Leu Ala His 

595 600 605 

Cys Thr Ala Ser Leu He Lys Ala Met Thr Tyr Tyr Gly Asn Leu Asp 

610 615 620 

Ser Glu Lys Pro He Phe Phe Glu Ser Val Ser Ala Ala He Ser His 
625 630 635 640 

He His Ser Asn Lys Asn Leu Ser Lys Leu Ser Asp His Ser Glu Val 
645 650 655 

<210> 11 

<211> 1797 

<212> DMA 

<213> homo sapiens 

<400> 11 

atgacaggag caeLagaggeia aaagaaaagc atgctttgga gcaagatgca taccccccag 60 

tgtgaagaca ttatacagtg gtgtagaagg cgactgccca ttttggattg ggcaccacat 120 

tacaatctga aagaaaactt gcttccagac actgtgtctg ggataatgtt ggcagttcaa 180 

caggtgaccc aaggattggc ctttgctgtt ctctcatctg tgcacccagt gtttggttta 240 

tatgggtctc tgtttcctgc cataatttat gccatatttg gaatgggaca tcatgttgcc 300 

acaggcacct ttgccttgac atccttaata tcagccaacg ccgtggaacg gattgtccct 360 

cagaacatgc agaatctcac cacacagagt aacacaagcg tgctgggctt atccgacttt 420 

gaaatgcaaa ggatccacgt tgctgcagca gtttccttct tgggaggtgt gattcaggtg 480 

gccatgtttg tgctgcaact gggcagtgcc acatttgtgg tcacagagcc tgtgatcagc 540 
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gcaatgacaa ctggggctgc cacccatgtg gtgacttcac aagtcaaata tctcttggga 600 

atgaaaatgc catatatatc cggaccactt ggattctttt atatttatgc atatgttttt 660 

gaaaacatca agtctgtgcg actggaagca ttgcttttat ccttgctgag cattgtggtc 720 

cttgttcttg ttaaagagct gaatgaacag tttaaaagga aaattaaagt tgttcttcct • 780 

gtagatttag ttttgattat tgctgcatca tttgcttgtt attgcaccaa tatggaaaac 840 

acatatggat tagaagtagt tggtcatatt ccacaaggaa ttccctcacc tagagctccc 900 

ccgatgaaca tcctctctgc ggtgatcact gaagctttcg gagtggcact tgtaggctat 960 

gtggcctcac tggctcttgc tcaaggatct gccaaaaaat tcaaatattc aattgatgac 1020 

aaccaggaat ttttggccca tggcctcagc aatatagttt cttcattttt cttctgcata 1080 

ccaagtgctg ctgccatggg aaggacggct ggcctgtaca gcacaggagc gaagacacag 1140 

gtggcttgtc taatatcttg cattttcgtc cttatagtca tctatgcaat aggacctttg 1200 

ctttactggc tgcccatgtg tgtccttgca agcattattg ttgtgggact gaagggaatg 1260 

ctaatacagt tccgagattt aaaaaaatat tggaatgtgg ataaaatcga ttggggaacc 1320 

ctgcagcagg tgaaaattat ctcaataaac aacccgcttg ttttcctgaa tgcaaaaaaa 1380 

ttttatactg atttaatgaa catgatccaa aaggaaaatg cctgtaatca gccacttgat 1440 

gatatcagca agtgtgaaca aaacacattg cttaattccc tatccaatgg caactgcaat 1500 

gaagaagctt cacagtcctg ccctaatgag aagtgttatt taatcctgga ttgcagtgga 1560 

tttacctttt ttgactattc tggagtctcc atgcttgttg aggtttacat ggactgtaaa 1620 

ggcaggagtg tggatgtatt gttagcccat tgtacagctt ccttgataaa agcaatgacg 1680 

tattatggaa acctagactc agagaaacca attttttttg aatcggtatc tgctgcaata 1740 

agtcatatcc attcaaataa gaatttgagc aaactcagtg accacagtga agtctga 1797 

<210> 12 

<211> 598 

<212> PRT 

<213> homo sapiens 

<400> 12 

Met Thr Gly Ala Lys Arg Lys Lys Lys Ser Met Leu Trp Sex Lys Met 

15 10 15 

His Thr Pro Gin Cys Glu Asp He He Gin Trp Cys Arg Arg Arg Leu 

20 25 30 

Pro He Leu Asp Trp Ala Pro His Tyx Asn Leu Lys Glu Asn Leu Leu 

35 40 45 

Pro Asp Thr Val Ser Gly He Met Leu Ala Val Gin Gin Val Thr Gin 

50 55 60 

Gly Leu Ala Phe Ala Val Leu Ser Ser Val His Pro Val Phe Gly Leu 
65 70 75 80 

Tyr Gly Ser Leu Phe Pro Ala He He Tyr Ala He Phe Gly Met Gly 

85 90 95 

His His Val Ala Thr Gly Thr Phe Ala Leu Thr Ser Leu He Ser Ala 

100 105 110 

Asn Ala Val Glu Arg He Val Pro Gin Asn Met Gin Asn Leu Thr Thr 

115 120 125 

Gin Ser Asn Thr Ser Val lieu Gly Leu Ser Asp Phe Glu Met Gin Arg 

130 135 140 

He His Val Ala Ala Ala Val Ser Phe Leu Gly Gly Val He Gin Val 
145 150 155 160 

Ala Met Phe Val Leu Gin Leu Gly Ser Ala Thr Phe Val Val Thr Glu 

165 170 175 

Pro Val He Ser Ala Met Thr Thr Gly Ala Ala Thr His Val Val Thr 

180 185 190 

Ser Gin Val Lys Tyr Leu Leu Gly Met Lys Met Pro Tyr He Ser Gly 

195 200 205 

Pro Leu Gly Phe Phe Tyr He Tyr Ala Tyr Val Phe Glu Asn He Lys 
210 215 220 
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Ser 


Val 


Arg 


Leu 


Glu Ala Leu Leu Leu Ser Leu 


Leu Ser He Val Val 


225 








230 235 


240 


hevL 


Val 


Leu 


Val 


Lys Glu Leu Asn Glu Gin Phe 


Lys Arg Lys He Lys 










245 250 


255 


Val 


Val 


Leu 


Pro 


Val Asp Leu Val Leu He He 


Ala Ala Ser Phe Ala 








260 


265 


270 


Cys 


Tyx 


Cys 


Thr 


Asn Met Glu Asn Thr Tyr Gly 


Leu Glu Val Val Gly 






275 




280 


285 


His 


He 


Pro 


Gin 


Gly He Pro Ser Pro Arg Ala 


Pro Pro Met Asn He 




290 






295 


300 


Leu 


Ser 


Ala 


Val 


He Thr Glu Ala Phe Gly Val 


Ala Leu Val Gly Tyr 


305 








310 315 


320 


Val 


Ala 


Ser 


Leu 


Ala Leu Ala Gin Gly Ser Ala 


Lys Lys Phe Lys Tyr 










325 330 


335 


Ser 


lie 


Asp 


Asp 


Asn Gin Glu Phe Leu Ala His 


Gly Leu Ser Asn He 








340 


345 


350 


Val 


Ser 


Ser 


Phe 


Phe Phe Cys He Pro Ser Ala 


Ala Ala Met Gly Arg 






355 




360 


365 - 


Thr 


Ala 


Gly 


Leu 


Tyr Ser Thr Gly Ala Lys Thr 


Gin Val Ala Cys Leu 




370 






375 


380 


lie 


Ser 


Cys 


lie 


Phe Val Leu He Val He Tyr 


Ala He Gly Pro Leu 


385 








390 395 


400 


Leu 


Tyr 


Trp 


Leu 


Pro Met Cys Val Leu Ala Ser 


He He Val Val Gly 










405 410 


415 


Leu 


Lys 


Gly 


Met 


Leu He Gin Phe Arg Asp Leu Lys Lys Tyr Trp Asn 








420 


425 


430 


Val 


Asp 


Lys 


He 


Asp Trp Gly Thr Leu Gin Gin Val Lys He He Ser 






435 




440 


445 


lie 


Asn 


Asn 


Pro 


Leu Val Phe Leu Asn Ala Lys Lys Phe Tyr Thr Asp 




450 






455 


460 



Leu Met Asn Met He Gin Lys Glu Asn Ala Cys Asn Gin Pro Leu T^p 
465 470 475 480 

Asp He Ser Lys Cys Glu Gin Asn Thr Leu Leu Asn Ser Leu Ser Asn 

485 490 495 

Gly Asn Cys Asn Glu Glu Ala Ser Gin Ser Cys Pro Asn Glu Lys Cys 

500 505 510 

Tyr Leu He Leu Asp Cys Ser Gly Phe Thr Phe Phe Asp Tyr Ser Gly 

515 520 525 

Val Ser Met Leu Val Glu Val Tyr Met Asp Cys Lys Gly Arg Ser Val 

530 535 540 

Asp Val Leu Leu Ala His Cys Thr Ala Ser Leu He Lys Ala Met Thr 
545 550 555 560 

Tyr Tyr Gly Asn Leu Asp Ser Glu Lys Pro He Phe Phe Glu Ser Val 

565 570 575 

Ser Ala Ala He Ser His He His Ser Asn Lys Asn Leu Ser Lys Leu 

580 585 590 

Ser Asp His Ser Glu Val 
595 

<210> 13 

<211> 2420 

<212> rajA 

<213> homo sapiens 

<400> 13 

ctccttcaga taagaattca agctttgaca ttgtaaacca cagacgaatt ggagcttggc 60 
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attgaaagga ggtgttctgc aatgattttt tttcttgttt agagaagttt acttctacaa 120 

gaagaaatct gaaaaatgac aggagcaaag aggaaaaaga aaagcatgct ttggagcaag 180 

atgcataccc cccagtgtga agacattata cagtggtgta gaaggcgact gcccattttg 240 

gattgggcac cacattacaa tctgaaagaa aacttgcttc cagacactgt gtctgggata 300 

atgttggcag ttcaacaggt gacccaagga ttggcctttg ctgttctctc atctgtgcac 360 

ccagtgtttg gtttatatgg gtctctgttt cctgccataa tttatgccat atttggaatg 420 

ggacatcatg ttgccacagg cacctttgcc ttgacatcct taatatcagc caacgccgtg 480 

gaacggattg tccctcagaa catgcagaat ctcaccacac agagtaacac aagcgtgctg 540 

ggcttatccg actttgaaat gcaaaggatc cacgttgctg cagcagtttc cttcttggga 600 

ggtgtgattc aggtggccat gtttgtgctg caactgggca gtgccacatt tgtggtcaca 660 

gagcctgtga tcagcgcaat gacaactggg gctgccaccc atgtggtgac ttcacaagtc 720 

aaatatctct tgggaatgaa aatgccatat atatccggac cacttggatt ctttjtatatt 780 

tatgcatatg tttttgaaaa catcaagtct gtgcgactgg aagcattgct tttatccttg 840 

ctgagcattg tggtccttgt tcttgttaaa gagctgaatg aacagtttaa aaggaaaatt 900 

aaagttgttc ttcctgtaga tttagttttg attattgctg catcatttgc ttgttattgc 960 

accaatatgg aaaacacata tggattagaa gtagttggtc, atattccaca aggaattccc 1020 

tcacctagag ctcccccgat gaacatcctc tctgcggtga tcactgaagc tttcggagtg 1080 

gcacttgtag gctatgtggc ctcactggct cttgctcaag gatctgccaa aaaattcaaa 1140 

tattcaattg atgacaacca ggaatttttg gcccatggcc tcagcaatat agtttcttca 1200 

tttttcttct gcataccaag tgctgctgcc atgggaagga cggctggcct gtacagcaca 1260 

ggagcgaaga cacaggtggc ttgtctaata tcttgcattt tcgtccttat agtcatctat 1320 

gcaataggac ctttgcttta ctggctgccc atgtgtgtcc ttgcaagcat tattgttgtg 1380 

ggactgaagg gaatgctaat acagttccga gatttaaaaa aatattggaa tgtggataaa 1440 

atcgattggg gaatatgggt cagtacatat gtatttacaa tatgctttgc tgccaatgtg 1500 

ggactgctgt ttggtgttgt ttgtaccata gctatagtga taggacgctt cccaagagca 1560 

atgactgtaa gtataaaaaa tatgaaagwa atggaattta aagtgaagac agaaatggac 1620 

agtgaaaccc tgcagcaggt gaaaattatc tcaataaaca acccgcttgt tttcctgaat 1680 

gcaaaaaaat tttatactga tttaatgaac atgatccaaa aggaaaatgc ctgtaatcag 1740 

ccacttgatg atatcagcaa gtgtgaacaa aacacattgc ttaattccct atccaatggc 1800 

aactgcaatg aagaagcttc acagtcctgc cctaatgaga agtgttattt aatcctggat 1860 

tgcagtggat ttaccttttt tgactattct ggagtctcca tgcttgttga ggtttacatg 1920 

gactgtaaag gcaggagtgt ggatgtattg ttagcccatt gtacagcttc cttgataaaa 1980 

gcaatgacgt attatggaaa cctagactca gagaaaccaa ttttttttga atcggtatct 2040 

gctgcaataa gtcatatcca ttcaaataag aatttgagca aactcagtga ccacagtgaa 2100 

gtctgagacc cttttgtcac agtacagctc ttgtctttac caactgcctg aagaggccat 2160 

atgctggcat tttgcacaac tttttggttg tttagatcct acagatgacc tctgctacaa 2220 

taagtacgat gtgacttagt aactgcatag cagttggaaa gaactgccaa cttttttttc 2280 

tcatttttgt tagtaagaag attcgcttag ttattttatg taaaaatcag tatgtgttta 2340 

gttttagtgt actgaagggt aaacatggtt ttattttatt ttaccatatt attttgggtt 2400 

ggtttatttc tattgggctg 2420 
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